This richly illustrated and clearly written undergraduate textbook captures 
the excitement and beauty of geometry. The approach is that of Klein 
in his Erlangen programme: a geometry is a space together with a set of 
transformations of the space. The authors explore various geometries: 
affine, projective, inversive, hyperbolic and elliptic. In each case they 
carefully explain the key results and discuss the relationships between the 
geometries. 

New features in this Second Edition include concise end-of-chapter 
summaries to aid student revision, a list of Further Reading and a list 
of Special Symbols. The authors have also revised many of the end-of- 
chapter exercises to make them more challenging and to include some 
interesting new results. Full solutions to the 200 problems are included 
in the text, while complete solutions to all of the end-of-chapter exercises 
are available in a new Instructors’ Manual, which can be downloaded from 
www.cambridge.org/9781 107647831. 


Praise for the First Edition 


‘To my mind, this is the best introductory book ever written on 
introductory university geometry... Not only are students introduced to a 
wide range of algebraic methods, but they will encounter a most pleasing 
combination of process and product.’ 

P. N. RUANE, MAA Focus 


NOILIGH GNOOKYS 


‘... an excellent and precisely written textbook that should be studied in 
depth by all would-be mathematicians.’ 
HANS SACHS, American Mathematical Society 
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Preface 


Geometry! For over two thousand years it was one of the criteria for recog- 
nition as an educated person to be acquainted with the subject of geometry. 
Euclidean geometry, of course. 

In the golden era of Greek civilization around 400 BC, geometry was studied 
rigorously and put on a firm theoretical basis — for intellectual satisfaction, the 
intrinsic beauty of many geometrical results, and the utility of the subject. 
For example, it was written above the door of Plato’s Academy ‘Let no-one 
ignorant of Geometry enter here!’ Indeed, Archimedes is said to have used the 
reflection properties of a parabola to focus sunlight on the sails of the Roman 
fleet besieging Syracuse and set them on flame. 

For two millennia the children of those families sufficiently well-off to be 
educated were compelled to have their minds trained in the noble art of rigor- 
ous mathematical thinking by the careful study of translations of the work of 
Euclid. This involved grasping the notions of axioms and postulates, the draw- 
ing of suitable construction lines, and the careful deduction of the necessary 
results from the given facts and the Euclidean axioms — generally in two- 
dimensional or three-dimensional Euclidean space (which we shall denote by 
R? and R?, respectively). Indeed, in the 1700s and 1800s popular publications 
such as The Lady’s and Gentleman’s Diary published geometric problems 
for the consideration of gentlefolk at their leisure. And as late as the 1950s 
translations of Euclid’s Elements were being used as standard school geometry 
textbooks in many countries. 

Just as nowadays, not everyone enjoyed Mathematics! For instance, the 
German poet and philosopher Goethe wrote that “Mathematicians are like 
Frenchmen: whatever you say to them, they translate into their own language, 
and forthwith it is something entirely different!’ 

The Golden Era of geometry came to an end rather abruptly. When the 
USSR launched the Sputnik satellite in 1957, the Western World suddenly 
decided for political and military reasons to give increased priority to its 
research and educational efforts in science and mathematics, and redeveloped 
the curricula in these subjects. In order to make space for subjects newly 
developed or perceived as more ‘relevant in the modern age’, the amount of 
geometry taught in schools and universities plummeted. Interest in geometry 
languished: it was thought ‘old-fashioned’ by the fashionable majority. 


Plato (c. 427-347 BC) 
was an Athenian 
philosopher who 
established a school of 
theoretical research (with 
a mathematical bias), 
legislation and 
government. 


Archimedes (c. 287-212 
BC) was a Greek 
geometer and physicist 
who used many of the 
basic limiting ideas of 
differential and integral 
calculus. 

Euclid (c. 325-265 BC) 
was a mathematician in 
Hellenistic Alexandria 
during the reign of 
Ptolemy I (323-283 BC), 
famous for his book The 
Elements. 

We give a careful 
algebraic definition of R?2 
and R? in Appendix 2. 


Johann Wolfgang von 
Goethe (1749-1832) is 
said to have studied all 
areas of science of his day 
except mathematics — for 
which he had no aptitude. 
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Nowadays it is being realized that geometry is still a subject of abiding 
beauty that provides tremendous intellectual satisfaction in return for effort put 
into its study, and plays a key underlying role in the understanding, develop- 
ment and applications of many other branches of mathematics. More and more 
universities are reintroducing courses in geometry, to give students a ‘feel’ 
for the reasons for studying various areas of mathematics (such as Topology), 
to service the needs of Computer Graphics courses, and so on. Geometry is 
having a revival! 

Since 1971, the Open University in the United Kingdom has taught math- 
ematics to students via specially written correspondence texts, and has tradi- 
tionally given geometry a central position in its courses. This book arises from 
those correspondence texts. 

We adopt the Klein approach to geometry. That is, we regard the various 
geometries as each consisting of an underlying set together with a group of 
transformations acting on that set. Those properties of the set that are not 
altered by any of the transformations are called the properties of that geometry. 

Following a historical review of the development of the various geometries, 
we look at conics (and at the related quadric surfaces) in Euclidean geometry. 
Then we address a whole series of different geometries in turn. First, affine 
geometry (that provides simple proofs of some results in Euclidean geome- 
try). Then projective geometry, which can be regarded as the most basic of 
all geometries; we divide this material into a chapter on projective lines and 
a chapter on projective conics. We then return to study inversive geometry, 
which provides beautiful proofs of many results involving lines and circles in 
Euclidean geometry. This leads naturally to the study of hyperbolic geometry 
in the unit disc, in which there are two lines through any given point that are 
parallel to a given line. Via the link of stereographic projection, this leads on 
to spherical geometry: a natural enough concept for a human race that lives 
on the surface of a sphere! Finally we tie things together, explaining how the 
various geometries are inter-related. 


Study Guide 


The book assumes a basic knowledge of Group Theory and of Linear Alge- 
bra, as these are used throughout. However, for completeness and students’ 
convenience we give a very rapid review of both topics in the appendices. 

The book follows many of the standard teaching styles of The Open Uni- 
versity. Thus, most chapters are divided into five sections (each often further 
divided into subsections); sections are numbered using two digits (such as 
‘Section 3.2’) and subsections using three digits (such as ‘Subsection 3.2.4’). 
Generally a section is considered to be about one evening’s hard work for an 
average student. 

We number in order the theorems, examples, problems and equations within 
each section. 

We use wide pages with margins in which we place various historical notes, 
cross-references, teaching comments and diagrams; the cross-references need 
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Topics in computer 
graphics such as ‘hidden’ 
surfaces and the shading 
of curved surfaces involve 
much mathematics. 
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Chapter 2 
Chapters 3 and 4 
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Chapter 6 


Chapter 7 
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Preface 


not be consulted by students unless they wish to remind themselves of some 
point on that topic, but the other margin notes should be read carefully. We use 
boxes in the main text to highlight definitions, strategies, and the statements of 
theorems and other key results. The end of the proof of a theorem is indicated 
by a solid symbol ‘Ml’, and the end of the solution of a worked example by a 


hollow symbol “L1’. Occasionally the text includes a set of ‘Remarks’; these are 


comments of the type that an instructor would give orally to a class, to clarify 
a definition, result, or whatever, and should be read carefully. There are many 
worked examples within the text to explain the concepts being taught, and it is 
important that students read these carefully as they contain many key teaching 
points; in addition, there is a good stock of in-text problems to reinforce the 
teaching, and solutions to these are given in Appendix 3. At the end of each 
chapter there are exercises covering the material of that chapter, some of which 
are fairly straight-forward and some are more challenging; solutions are not 
given to the exercises. 

Our philosophy is to provide clear and complete explanations of all geomet- 
ric facts, and to teach these in such a way that students can understand them 
without much external help. As a result, students should be able to learn (and, 
we hope, to enjoy) the key concepts of the subject in an uncluttered way. 

Most students will have met many parts of Chapter | already, and so can 
proceed fairly quickly through it. Thereafter it is possible to tackle Chapters 2 
to 4 or Chapters 5 and 6, in either order. It is possible to omit Chapters 7 or 8, 
if the time in a course runs short. 


Notation for Functions as Mappings 


Suppose that a function f maps some set A into some set B, and that it maps 
a typical point x of A onto some image point y of B. Then we say that A is the 
domain (or domain of definition) of f, B the codomain of f, and denote the 
function f as a mapping (or map) as follows: 


f:A->B 


xhy 


We often denote y by the expression f(x) to indicate its dependence on f 
and x. 
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Note that we use two 
different arrows here, to 
distinguish between the 
mapping of a set and the 
mapping of an element. 
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Changes in the Second Edition 


In addition to correcting typos and errors, the authors have changed the term 
‘gradient’ to ‘slope’, and avoided the use of ‘reversed square brackets’ — so 
that, for instance, the interval {x : 0 < x < 1} is now written as (0,1] rather 
than ]0,1]. Also, they have clarified the difference between a geometry and 
models of that geometry; in particular, the term ‘non-Euclidean’ geometry has 
now been largely replaced by ‘hyperbolic’ geometry, and the term ‘elliptic’ 
geometry has been introduced where appropriate. The problems and exercises 
have been revised somewhat, and more exercises included. Each chapter now 
includes a summary of the material in that chapter, and before the appendices 
there are now lists of symbols and suggestions for further reading. 

The authors have taken the opportunity to add some new material to enrich 
the reader’s diet: a treatment of conics as envelopes of tangent families, 
barycentric coordinates, Poncelet’s Porism and Ptolemy’s Theorem, and planar 
maps. Also, the treatment of a number of existing topics has been significantly 
changed: the geometric interpretation of projective transformations, the anal- 
ysis of the formula for hyperbolic distance, and the treatment of asymptotic 
d-triangles. 

The authors appreciate the warm reception of the first edition, and have tried 
to take on board as many as possible of the helpful comments received. Special 
thanks are due to John Snygg and Jonathan I. Hall for invaluable comments and 
advice. 


Instructors’ Manual 


Complete solutions to all of the end-of-chapter exercises are available in an 
Instructors’ Manual, which can be downloaded from www.cambridge.org/ 
9781107647831. 
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Solutions to the exercises 
appear in an Instructors’ 
Manual available from the 
publisher. 


Introduction: Geometry 
and Geometries 


Geometry is the study of shape. It takes its name from the Greek belief that 
geometry began with Egyptian surveyors of two or three millennia ago mea- 
suring the Earth, or at least the fertile expanse of it that was annually flooded 
by the Nile. 

It rapidly became more ambitious. Classical Greek geometry, called 
Euclidean geometry after Euclid, who organized an extensive collection of 
theorems into his definitive text The Elements, was regarded by all in the early 
modern world as the true geometry of space. Isaac Newton used it to formu- 
late his Principia Mathematica (1687), the book that first set out the theory 
of gravity. Until the mid-19th Century, Euclidean geometry was regarded as 
one of the highest points of rational thought, as a foundation for practical 
mathematics as well as advanced science, and as a logical system splendidly 
adapted for the training of the mind. We shall see in this book that by the 1850s 
geometry had evolved considerably — indeed, whole new geometries had been 
discovered. 

The idea of using coordinates in geometry can be traced back to Apollo- 
nius’s treatment of conic sections, written a generation after Euclid. But their 
use in a systematic way with a view to simplifying the treatment of geome- 
try is really due to Fermat and Descartes. Fermat showed how to obtain an 
equation in two variables to describe a conic or a straight line in 1636, but his 
work was only published posthumously in 1679. Meanwhile in 1637 Descartes 
published his book Discourse on Method, with an extensive appendix enti- 
tled La Géometrie, in which he showed how to introduce coordinates to solve 
a wide variety of geometrical problems; this idea has become so central a 
part of mathematics that whole sections of La Géometrie read like a modern 
textbook. 

A contemporary of Descartes, Girard Desargues, was interested in the ideas 
of perspective that had been developed over many centuries by artists (anx- 
ious to portray three-dimensional scenes in a realistic way on two-dimensional 
walls or canvases). For instance, how do you draw a picture of a building, 
or a staircase, which your client can understand and commission, and from 
which artisans can deduce the correct dimensions of each stone? Desargues 
also realized that since any two conics can always be obtained as sections of 
the same cone in R’, it is possible to present the theory of conics in a unified 


The word comes from the 
Greek words geo (Earth) 
and metria (measuring). 


Isaac Newton 
(1643-1727) was an 
English astronomer, 
physicist and 
mathematician. He was 
Professor of Mathematics 
at Cambridge, Master of 
the Royal Mint, and 
successor of Samuel 
Pepys as President of the 
Royal Society. 


Apollonius of Perga 

(c. 255-170 BC) was a 
Greek geometer, whose 
only surviving work is a 
text on conics. 


Pierre de Fermat 
(1601-1665) was a French 
lawyer and amateur 
mathematician, who 
claimed to have a proof of 
the recently proved 
Fermat’s Last Theorem in 
Number Theory. 


René Descartes 
(1596-1650) was a French 
scientist, philosopher and 
mathematician. He is also 
known for the phrase 
‘Cogito, ergo sum’ (I 
think, therefore I am). 
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way, using concepts which later mathematicians distilled into the notion of 
the cross-ratio of four points. Desargues’ discoveries came to be known as 
projective geometry. 

Blaise Pascal was the son of a mathematician, Etienne, who attended a group 
of scholars frequented by Desargues. He heard of Desargues’s work from his 
father, and quickly came up with one of the most famous results in the geom- 
etry of conics, Pascal’s Theorem, which we discuss in Chapter 4. By the late 
19th century projective geometry came to be seen as the most basic geometry, 
with Euclidean geometry as a significant but special case. 

At the start of the 19th century the world of mathematics began to change. 
The French Revolution saw the creation of the Ecole Polytechnique in Paris 
in 1794, an entirely new kind of institution for the training of military engi- 
neers. It was staffed by mathematicians of the highest calibre, and run for 
many years by Gaspard Monge, an enthusiastic geometer who had invented a 
simple system of descriptive geometry for the design of forts and other mili- 
tary sites. Monge was one of those rare teachers who get students to see what is 
going on, and he inspired a generation of French geometers. The Ecole Poly- 
technique, moreover, was the sole entry-point for any one seeking a career 
in engineering in France, and the stranglehold of the mathematicians ensured 
that all students received a good, rigorous education in mathematics before 
entering the specialist engineering schools. Thus prepared they then assisted 
Napoleon’s armies everywhere across Europe and into Egypt. 

One of the Ecole’s former students, Jean Victor Poncelet, was taken prisoner 
in 1812 in Napoleon’s retreat from Moscow. He kept his spirits up during a 
terrible winter by reviewing what his old teacher, Monge, had taught him about 
descriptive geometry. This is a system of projections of a solid onto a plane — 
or rather two projections, one vertically and one horizontally (giving what are 
called to this day the plan and elevation of the solid). Poncelet realized that 
instead of projecting “from infinity’ so to speak, one could adapt Monge’s ideas 
to the study of projection from a point. In this way he re-discovered Desargues’ 
ideas of projective geometry. During his imprisonment he wrote his famous 
book Traité des propriétés projectives des figures outlining the foundations of 
projective geometry, which he extensively rewrote after his release in 1814 and 
published in 1822. 

Around the same time that projective geometry was emerging, mathemati- 
cians began to realize that there was more to be said about circles than they 
had previously thought. For instance, in the study of electrostatics let £; and 
£9 be two infinitely long parallel cylinders of opposite charge. Then the inter- 
section of the surfaces of equipotential with a vertical plane is two families of 
circles (and a single line), and a point charge placed in the electrostatic field 
moves along a circular path through a specific point inside each cylinder, at 
right angles to circles in the families. The study of properties of such fami- 
lies of circles gave rise to a new geometry, called inversive geometry, which 
was able to provide particularly striking proofs of previously known results in 
Euclidean geometry as well as new results. 


Girard Desargues 
(1591-1661) was a French 
architect. 

We deal with these ideas 
in Chapters 4 and 5. 


Blaise Pascal 
(1623-1662) was a French 
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physicist and philosopher. 


Gaspard Monge 
(1746-1818) was a French 
analyst and geometer. A 
strong republican and 
supporter of the 
Revolution, he was French 
Minister of the Navy in 
1792-93, but deprived of 
all his honours on the 
restoration of the French 
monarchy. 


Jean Victor Poncelet 
(1788-1867) followed a 
career as a military 
engineer by becoming 
Professor of Mechanics at 
Metz, where he worked on 
the efficiency of turbines. 


vertical 
projection 


<I\---£ 
horizontal 
p--- ait 
b 


' 1! projection 


0: Introduction: Geometry and Geometries 


In inversive geometry mathematicians had to add a ‘point at infinity’ to the 
plane, and had to regard circles and straight lines as equivalent figures under 
the natural mappings, inversions, as these can turn circles into lines, and vice- 
versa. Analogously, in projective geometry mathematicians had to add a whole 
‘line at infinity’ in order to simplify the geometry, and found that there were 
projective transformations that turned hyperbolas into ellipses, and so on. So 
mathematicians began to move towards thinking of geometry as the study of 
shapes and the transformations that preserve (at least specified properties of) 
those shapes. 

For example, there are very few theorems in Euclidean geometry that depend 
on the size of the figure. The ability to make scale copies without altering 
‘anything important’ is basic to mathematical modelling and a familiar fact 
of everyday life. If we wish to restrict our attention to the transformations 
that preserve length, we deal with Euclidean geometry, whereas if we allow 
arbitrary changes of scale we deal with similarity geometry. 

Another interesting geometry was discovered by Mobius in the 1820s, in 
which transformations of the plane map lines to lines, parallel lines to parallel 
lines, and preserve ratios of lengths along lines. He called this geometry affine 
geometry because any two figures related by such a transformation have a like- 
ness or affinity to one another. This is the geometry appropriate, in a sense, to 
Monge’s descriptive geometry, and the geometry that describes the shadows of 
figures in sunlight. 

Since the days of Greek mathematics, with a stimulus provided by the needs 
of commercial navigation, mathematicians had studied spherical geometry too; 
that is, the geometry of figures on the surface of a sphere. Here geometry 
is rather different from plane Euclidean geometry; for instance the area of 
a triangle is proportional to the amount by which its angle sum exceeds zr, 
and there is a nice generalization of Pythagoras’ Theorem, which says that 
in a right-angled triangle with sides a,b and the hypotenuse c, then cosc = 
cosa - cosb. It turns out that there is a close connection between spherical 
geometry and inversive geometry. 

For nearly two millennia mathematicians had accepted as obvious the 
Parallel Postulate of Euclid: namely, that given any line @ and any point P 
not on £, there is a unique line m in the same plane as P and £ which passes 
through P and does not meet £. Indeed much effort had been put into deter- 
mining whether this Postulate could be deduced from the other assumptions 
of Euclidean geometry. In the 1820s two young and little-known mathemati- 
cians, Bolyai in Hungary and Lobachevskii in Russia, showed that there 
were perfectly good so-called ‘non-Euclidean geometries’, namely hyper- 
bolic geometry and elliptic geometry, that share all the initial assumptions of 
Euclidean geometry except the parallel postulate. 

In hyperbolic geometry given any line @ and any point P not on £, there are 
infinitely many lines in the same plane as P and £ which pass through P and do 
not meet £; in elliptic geometry all lines intersect each other. However, it still 
makes sense in both hyperbolic and elliptic geometries to talk about the length 
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Janos Bolyai (1802-1860) 
was an officer in the 
Hungarian Army. 


Nicolai Ivanovich 
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Russian geometer who 
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University of Kazan. 
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of line segments, the distance between points, the angles between lines, and 

so forth. Around 1900 Poincaré did a great deal to popularise these geometries Jules Henri Poincaré 
by demonstrating their applications in many surprising areas of mathematics, (1854-1912) was a 
such as Analysis. prolific French 


Wot a é . thematician, physicist, 
By 1870, the situation was that there were many geometries: Euclidean, een kegs 
astronomer and 


philosopher at the 
ematicians have of coping with the growth of their subject is to re-define it so University of Paris. 


that different branches of it become branches of the same subject. This was 
done for geometry by Klein, who developed a programme (the Erlangen Pro- Christian Felix Klein 


affine, projective, inversive, hyperbolic and elliptic geometries. One way math- 


gramme) for classifying geometries. His elegant idea was to regard a geometry (1849-1925) was a 
German algebraist, 


‘ ee ; ter, topologist and 
ties of figures that are not altered by any transformation in the group are their ee 
physicist; he became a 


geometrical properties. professor at the University 
For example, in two-dimensional Euclidean geometry the space is the plane _ of Erlangen at the 

and the group is the group of all length-preserving transformations of the plane remarkable age of 22. 

(or isometries). In projective geometry the space is the plane enlarged (in a way 

we make precise in Chapter 6) by a line of extra points, and the group is the 


as a space together with a group of transformations of that space; the proper- 


group of all continuous transformations of the space that preserve cross-ratio. 
Klein’s approach to a geometry involves three components: a set of points 
(the space), a set of transformations (that specify the invariant properties — for 
example, congruence in Euclidean geometry), and a group (that specifies how 
the transformations may be composed). The transformations and their group 
are the fundamental components of the geometry that may be applied to differ- 
ent spaces. A model of a geometry is a set which possesses all the properties of 
the geometry; two different models of any geometry will be isomorphic. There 
may be several different models of a given geometry, which have different For example, you will 
advantages and disadvantages. Therefore, we shall use the terms ‘geometry’ meet two models of 
and ‘model (of a geometry)’ interchangeably whenever we think that there is }YPerbolic geometry. 
no risk of confusion. 
In fact as Klein was keen to stress, most geometries are examples of pro- 
jective geometry with some extra conditions. For example, affine geometry 
emerges as the geometry obtained from projective geometry by selecting a line 
and considering only those transformations that map that line to itself; the line 
can then be thought of as lying “at infinity’ and safely ignored. The result was 
that Klein not only had a real insight into the nature of geometry, he could even 
show that projective geometry was almost the most basic geometry. 
This philosophy of geometry, called the Kleinian view of geometry, is the 
one we have adopted in this book. We hope that you will enjoy this introduction 
to the various geometries that it contains, and go on to further study of one of 
the oldest, and yet most fertile, branches of mathematics. 


Conics 


The study of conics is well over 2000 years old, and has given rise to some of 
the most beautiful and striking results in the whole of geometry. 

In Section 1.1 we outline the Greek idea of a conic section — that is, a conic 
as defined by the curve in which a double cone is intersected by a plane. We 
then look at some properties of circles, the simplest of the non-degenerate 
conics, such as the condition for two circles to be orthogonal and the equations 
of the family of all circles through two given points. 

We explain the focus—directrix definition of the parabola, ellipse and hyper- 
bola, and study the focal-distance properties of the ellipse and hyperbola. 
Finally, we use the so-called Dandelin spheres to show that the Greek conic 
sections are just the same as the conics defined in terms of a focus and a 
directrix. 

In Section 1.2 we look at tangents to conics, and the reflection properties of 
the parabola, ellipse and hyperbola. It turns out that these are useful in prac- 
tical situations as diverse as anti-aircraft searchlights and astronomical optical 
telescopes! We also see how we can construct each non-degenerate conic as 
the ‘envelope’ of lines in a suitably-chosen family of lines. 

The equations of conics are all second degree equations in x and y. In 
Section 1.3 we show that the converse result holds — that is, that every sec- 
ond degree equation in x and y represents a conic. We also find an algorithm 
for determining from its equation in x and y which type of non-degenerate 
conic a given second degree equation represents, and for finding its principal 
features. 

The analogue in R? of a plane conic in R? is a quadric surface, specified 
by a suitable second degree equation in x, y and z. A well-known example 
of a quadric surface is the cooling tower of an electricity generating station. 
In Section 1.4 we find an algorithm for identifying from its equation which 
type of non-degenerate quadric a given second degree equation in x, y and z 
represents. We also discover that two of the non-degenerate quadric surfaces 
can be generated by two different families of straight lines, and that this feature 
is of practical importance. 


That is, they intersect at 
right angles. 


We use the notation R? 
and R? to denote 
2-dimensional and 
3-dimensional Euclidean 
space, respectively. 


1.1 Conic Sections and Conics 


1.1.1 Conic Sections 


Conic Section is the name given to the shapes that we obtain by taking different 
plane slices through a double cone. The shapes that we obtain from these cross- 
sections are as drawn below. 


y Ya 
oO]. x 7) t 
1. single point 2. single line 
y 
3 ya 
7 : C45 x 
4. parabola 5. ellipse 6. hyperbola 


Notice that the circle shown in slice 7 can be regarded as a special case of 
an ellipse. 

Notice, also, that the ellipse and the hyperbola both have a centre; that is, 
there is a point C such that rotation about C through an angle z is a symmetry 
of the conic. For example, for the ellipse and hyperbola illustrated above, the 
centre is in fact just the origin. On the other hand, the parabola does not have 
a centre. 

In Subsection 1.1.5 we shall verify that the curves, the ‘conic sections’, 
obtained by slicing through a double cone are exactly the same curves, the 
‘conics’, obtained as the locus of points in the plane whose distance from 
a fixed point is a constant multiple of its distance from a fixed line. As a 
result, we often choose not to distinguish between the terms ‘conic section’ 
and ‘conic’! 

We use the term non-degenerate conics to describe those conics that are 
parabolas, ellipses or hyperbolas; and the term degenerate conics to describe 
the single point, single line and pair of lines. 

In this chapter we study conics for their own interest, and we will meet them 
frequently throughout our study of geometry as the book progresses. 


1.1.2 Circles 


The first conic that we investigate is the circle. Recall that a circle in R? is the 
set of points (x, y) that lie at a fixed distance, called the radius, from a fixed 
point, called the centre of the circle. We can use the techniques of coordinate 
geometry to find the equation of a circle with given centre and radius. 


1: Conics 


It is thought that the 
Greek mathematician 
Menaechmus discovered 
the conic sections around 
350 BC. 


7. Circle 


P(x, y) 


BY 


Conic Sections and Conics 


Let the circle have centre C(a, b) and radius r. Then, if P(x, y) is an arbi- 
trary point on the circumference of the circle, the distance CP equals r. It 
follows from the formula for the distance between two points in the plane that 


r? = (x — a)” + (y —b)’. ()) 
If we now expand the brackets in equation (1) and collect the corresponding 
terms, we can rewrite equation (1) in the form 
mee y — 2ax — 2by + (a* +b? —r*) =0. 


Then, if we write f for —2a, g for —2b and h for a* +b? —r?, this equation 
takes the form 
x+y’ +t egyth=0. (2) 
It turns out that in many situations, however, equation (1) is more useful 
than equation (2) for determining the equation of a particular circle. 


Theorem 1 The equation of a circle in IR? with centre (a,b) and radius 
r 1s j . . 
(* —a)°+(y—b) =r’. 


For example, it follows from this formula that the circle with centre (—1, 2) 
and radius 3 has equation 


2 
(x +1? + -2)? = (V3) ; 
this can be simplified to give 


x*+2x+1+y?—4y4+453, 
or 
x+y? 42x —4y4+2=0. 
Problem 1 Determine the equation of each of the circles with the 
following centre and radius: 


(a) centre the origin, radius 1; 
(b) centre the origin, radius 4; 
(c) centre (3, 4), radius 2; 
(d) centre (3, 4), radius 3. 


Problem 2 Determine the condition on the numbers /, g and h in the 


equation 
e+ytfeteyth =0 


for the circle with this equation to pass through the origin. 
We have seen that the equation of a circle can be written in the form 
er +ytfrteyth=0. (2) 


In the opposite direction, given an equation of the form (2), can we deter- 
mine whether it represents a circle? If it does represent a circle, can we 
determine its centre and radius? 


Here we use the Distance 
Formula for the distance d 
between two points 

(x1, 1), (x2, y2) in R?: 


d? = (x, — x2)" 
+(y1 — y2)”. 
Note here that the 


coefficients of x? and y? 
are equal. 


ca) 


For example, consider the set of points (x, y) in the plane that satisfy the 
equation: 


x+y? —4x+6y+9=0. (3) 


In order to transform equation (3) into an equation of the form (1), we use the 
technique called ‘completing the square’ — we rewrite the terms that involve 
only xs and the terms that involve only ys as follows: 


x? —4x = (x —2)* -4, 

y? + 6y = (y +3)? -9. 
Substituting these expressions into equation (3), we obtain 

(x — 2)? + (y +3)? =4. 


It follows that the equation represents a circle whose centre is (2, —3) and 
whose radius is 2. 

In general, we can use the same method of ‘completing the square’ to rewrite 
the equation 


x*+y*+fce+ sy th =0 
in the form 
1 2 1 2 1 ¢2 1 ,2 
(x+4/) + (y+ 4s) =if + 78 —/A, (4) 


from which we can ‘read off’ the centre and radius. 


Theorem 2 An equation of the form 
w+ytfe+egyth=0 
represents a circle with 
centre (-2f, —38) and radius if? + 78° —h, 


provided that if? + 8° —h>0. 


Remark 

It follows from equation (4) above that if i fi + i g? —h <0, then there are 
no points (x, y) that satisfy the equation x? + y? + fe + gy +h = 0; and if 
i f+ 18° —h = 0, then the given equation simply represents the single point 


Problem 3 Determine the centre and radius of each of the circles 
given by the following equations: 


(a) x7 + y?-—2x—6y+1=0; (b)3x*+3y? — 12x —48y =0. 


1: Conics 


Note that in equation (3) 
the coefficients of x2 and 
y? are both 1. 


Note that —2 is half the 
coefficient of x, and +3 is 
half the coefficient of y, in 
equation (3). 


We can ‘read off’ the 
centre and radius of the 
circle from this equation. 


Here we start with the 
coefficients of x? and y2 
both equal (to 1). 
Otherwise the equation 
cannot be reformulated in 
the form (1). 


Conic Sections and Conics 


Problem 4 Determine the set of points (x, y) in R? that satisfies each 
of the following equations: 


(a) x7+y? +x+y+1=0; 
(b) x7 + y?—2x +4y4+5=0; 
(c) 2x* + 2y* +x -—3y-—5=0. 


Orthogonal Circles 


We shall sometimes be interested in whether two intersecting circles are 
orthogonal: that is, whether they meet at right angles. The following result 
answers this question if we know the equations of the two circles. 


Theorem 3 Orthogonality Test 
Two intersecting circles C; and C2 with equations 


x+y? + fixtgiyt+h;=0 and 
x+y? + fox + py tho =0, 
respectively, are orthogonal if and only if 


ili ar Big = 2ai =F 129). 


Proof The circle C; has centre A = (-4/1,-481) and radius r)} = 


4/ ify + 481 — hy; the circle Cz has centre B = (-34, —}82) and radius 


ro = a f5 + 783 — ho. 

Let P be one of their points of intersection, and look at the triangle AABP. 
If the circles meet at right angles, then the line AP is tangential to the circle 
C2, and is therefore at right angles to the line BP. So the triangle AABP is 
right-angled, and we may apply Pythagoras’ Theorem to it to obtain 


AP? + BP? = AB’. (5) 


Conversely, if equation (5) holds, then AABP must be a right-angled triangle 
and the circles must meet at right angles. 


Now 
AP =r} =4f,+48;—M1 and 
ppt ara lft ded — mr 
Also 


2 2 
AB* = (Ca = 3 f2) + (581 = $22) 


= Gri —shifet 1/3) ao (isi — 581g2+ 483). 


For example, in 
Chapters 5 and 6. 


You met these formulas in 
Theorem 2. 


We use the symbol A to 
indicate a triangle. 
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Substituting for AP*, BP? and AB* into equation (5), and cancelling 
common terms, we deduce that equation (5) is equivalent to 


hy — hy =—-3 fifo — 38182, 
that is, 
Si fo + 8182 = 2(h1 + h2). 
This is the required result. | 


Problem 5 Determine which, if any, of the following pairs of inter- 
secting circles are mutually orthogonal. 


(a) C) = {(x,y): x? +y?—4x—4y+7=0} and 
Co = { (x,y): x7 + y? +2x —8y +5=0} 

(b) Ci = {(x, y) x? +y? 43x —6y+5=0} and 
Co = { (x,y) : 3x2 + 3y? +4x + y—15 =O}. 


Circles through Two Points 
We shall also be interested later in the family of circles through two given 
points. So, let two circles C, and C2 with equations 
x +y*t fixtgiy+h;=0 and 
x+y? + fox + gay tho =0 (6) 
intersect at the distinct points P and Q, say. Then, if k 4 —1, the equation 
eP+y + fixtgiytmtk+y + fxtgyt+in)=0 (7) 


represents a circle since it is a second degree equation in x and y with equal 
(non-zero) coefficients of x? and y? and with no terms in xy. This circle 
passes through both P and Q; for the coordinates of P and Q both satisfy 
the equations in (6) and so must satisfy equation (7). 

If k = —1, equation (7) is linear in x and y, and so represents a line; since 
P and Q both lie on it, it must be the line through P and Q. 

Conversely, given any point FR in the plane that does not lie on the circle 
C2 we can substitute the coordinates of R into equation (7) to find the unique 
value of k such that the circle with equation (7) passes through R. We can think 
of the circle C2 as corresponding to the case ‘k = oo’ of equation (7). For, if 
we rewrite equation (7) in the form 


1 
erty t+ fixtaiy tht ty’ + foxt gry tho =0 (8) 


and let k —> oo, then 1/k — 0 and equation (8) becomes the equation of C2. 


Theorem 4 Let C; and C2 be circles with equations 
x+y + fixtgiyt+hi=0 and 
x+y? + fox + goyth2 =0 


1: Conics 


Section 5.5 


This is possible because, 
since R does not lie on 
C2, the term in the bracket 
in (7) does not vanish 

at R. 


Conic Sections and Conics 


that intersect at distinct points P and Q. Then the line and all circles (other 
than C2) through P and Q have an equation of the form 


By ee ery eee Gs ey on ey 0 


for some number k. 
If k # —1, this equation is one of the circles; if k = —1, this is the 
equation of the line. 


Example 1 Find the equation of the circle that passes through (1, 2) and the 
points of intersection of the circles 


x? +y?—3x+4y—-1=0 and x? +y?+4+3x-—3y+3=0. 
Solution By Theorem 4, the required equation is of the form 
x? 4 y? 3x + 4y— 14k (x? +? + $x -3y +3) =0 (9) 
for some number k. Since (1, 2) must satisfy this equation, it follows that 
1+4-34+8-1+k(14+44+$-6+3) =0, 


so that k = —3. Substituting k = —3 back into equation (9), we deduce that 
the equation of the required circle is 


x? 4 y?—3x-+4y— 1-3 (x? +y? + $x —3y 4 §) =0, 


which we can simplify to the form 


Ax? + 4y? + 21x —26y+11=0. 


Problem 6 Find the equation of the line through the points of inter- 
section of the circles 


x? 4y*-—3x+4y—-1=0 and 
2x? + 2y? + 5x —6y+3=0. 


1.1.3 Focus-Directrix Definition of the Non-Degenerate Conics 


Earlier we defined the conic sections as the curves of intersection of a double 
cone with a plane. We have seen that the circle can be defined in a different 
way: as the set of points at a fixed distance from a fixed point. 

Here we give a method for constructing the other non-degenerate conics, 
the parabola, ellipse and hyperbola, as sets of points that satisfy a some- 
what similar condition involving distances. Later we shall give a careful proof 


Subsection 1.1.1 
Subsection 1.1.2 


Subsection 1.1.5 
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12 1: Conics 


that each non-degenerate conic section is a non-degenerate (plane) conic, 

and vice-versa. Theorem 4 of 
The three non-degenerate conics (the parabola, ellipse and hyperbola) can Subsection 4.1.4 

be defined as the set of points P in the plane that satisfy the following condi- 

tion: The distance of P from a fixed point (called the focus of the conic) is a 

constant multiple (called its eccentricity, e) of the distance of P from a fixed 

line (called its directrix). 
The different conics arise according to the value of the eccentricity: 


Eccentricity A non-degenerate conic is an ellipse if 0 < e < 1, a parabola When e = 0, the ellipse is 


if e = 1, ora hyperbola if e > 1. actually a circle; the focus 
is the centre of the circle, 


and the directrix is ‘at 
Parabola (e = 1) infinity’. 
A parabola is defined to be the set of points P in the plane whose distance 
from a fixed point F is equal to their distance from a fixed line d. We obtaina = y, 
parabola in standard form if we choose 


1. the focus F to lie on the x-axis, and to have coordinates (a,0), a > 0; Fa, 0) 


2. the directrix d to be the line with equation x = —a. 


focus 


Notice in particular that the origin O(0,0) lies on the parabola since it is “"“"*“ 
equidistant from F and d. 

Let P(x, y) be an arbitrary point on the parabola, and let M be the foot of 
the perpendicular from P to the directrix. Since FP = PM, by the definition 
of the parabola, it follows that FP? = PM?; we may rewrite this equation in 


terms of coordinates as 
(x —a) + y= (@ +a)’. 
Multiplying out the brackets we get 
x? =a 4 77 = x7 + 2ax + a?, 


which simplifies to the equation y* = 4ax. 

Notice that each point with coordinates (ar, 2at), where t € R, lies on the We use the notation R to 
parabola, since (2at)? = 4a-at*. Conversely, we can write the coordinates of denote the ‘real numbers’ 
each point on the parabola in the form (ar, 2at). For if we choose t = y/(2a),  ™ anaes 
then y = 2at and 


2 
: x (from the equation vo = 4ax) 
a 
2at)* 
_ (2at) - at’, 
4a 


as required. It follows that there is a one—one correspondence between the real 
numbers ¢ and the points of the parabola. 


Conic Sections and Conics 


We summarize the above facts as follows. 


Parabola in Standard Form A parabola in standard form has equation 


2} 


y =4ax, wherea > 0. 


It has focus (a,0) and directrix x = 
parametric equations 


—a; and it can be described by the 


x =atr’, y=2at (t ER). 

We call the x-axis the axis of the parabola in standard form, since the 
parabola is symmetric with respect to this line, and we call the origin the vertex 
of a parabola in standard form, since it is the point of intersection of the axis 


of the parabola with the parabola. A parabola has no centre. 


Example2 This question concerns the parabola E with equation y” = 2x and 
parametric equations x = 50, y=t(teER). 


(a) Write down the focus, vertex, axis and directrix of E. 

(b) Determine the equation of the chord that joins distinct points P and Q on 
E with parameters ¢, and fz, respectively. Determine the condition on rt 
and f2 such that the chord PQ passes through the focus of E. 


Solution 


(a) The parabola E is the parabola in standard form where 4a = 2, ora = 5. 
It follows that the focus of E is (5. 0), its vertex is (0, 0), its axis is the 
x-axis, and the equation of its directrix is x = — 5 

(b) The coordinates of P and Q are (417, 1) and (44, n), respectively. So, 


if - x i. the slope (or gradient, as it is sometimes called) of PQ is 
given by 


yo fH) 2 
m= —= — a 
(aan) l(2— 2 
sti —9ty (47-4) th 
Since (34, 1) lies on the line PQ, it follows that the equation of PQ is 


pe ( 
1= x 
4 3 


Multiplying both sides by ft; + 2, we get 


1, 
L 3 
5) 


(t1 + 2)(y — 1) = 2x — 13, 


so that 


2 2 
(t) +h)y —t] — tt = 2x — ty, 
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‘Centre’ was defined in 
Subsection 1.1.1. 


We generally use the letter 
E to denote a conic. 


Such a chord is called 
focal chord. 


P (Stith) 


> 
F (0) x 
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or 
(t) + f)y = 2x + th. (10) 
If, however, ie = i then since t} # ft we have t) = —f2. Thus PQ is 
parallel to the y-axis, and so has equation x = 5; so in this case too, PQ 
has equation given by (10). 
The chord PQ with equation (10) passes through the focus (5. 0) if 
(tj + t2)0 = 1 + tyf2; in other words, if tjt = —1. 


Problem 7 This question concerns the parabola E with equation 
y? = x and parametric equations x = t?, y = t (t € R). 


(a) Write down the focus, vertex, axis and directrix of E. 

(b) Determine the equation of the chord that joins distinct points P and 
Q on E with parameters t, and f2, respectively. 

(c) Determine the condition on ft; and fg (and so on P and Q) that the 
focus of E is the midpoint of the chord PQ. 


Ellipse (0 < e < 1) 


We define an ellipse with eccentricity zero to be a circle. We have already 
discussed circles. 

We define an ellipse with eccentricity e (where 0 < e < 1) to be the set 
of points P in the plane whose distance from a fixed point F is e times their 
distance from a fixed line d. We obtain such an ellipse in standard form if we 
choose 


1. the focus F to lie on the x-axis, and to have coordinates (ae,0), a > 0; 
2. the directrix d to be the line with equation x = a/e. 


Let P(x, y) be an arbitrary point on the ellipse, and let M be the foot of the 
perpendicular from P to the directrix. Since FP = e - PM, by the definition of 
the ellipse, it follows that FP? = e? - PM’; we may rewrite this equation in 
terms of coordinates as 


2 
(x -— ae)" + y? =e (« _ =) = (ex — ay’. 
e 
Multiplying out the brackets we get 


2 


qo = Dae ae y =e 


x? — 2aex + a’, 
which simplifies to the equation 
e(1-e)+y? =a? (1-e) 


or 


1: Conics 


Subsection 1.1.2 


x=ale 
directrix 
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Substituting b for av1 —e?, so that b? = a*(1— e*), we obtain the 
standard form of the equation of the ellipse 


Notice that this equation is symmetrical in x and symmetrical in y, so that 
the ellipse also has a second focus F’(—ae,0) and a second directrix d’ with 
equation x = —a/e. 

The ellipse intersects the axes at the points (+a,0) and (0,+5). We call 
the segment joining the points (+a,0) the major axis of the ellipse, and the 
segment joining the points (0,5) the minor axis of the ellipse. Since b < a, 
the minor axis is shorter than the major axis. The origin is the centre of this 


ellipse. 
Notice that each point with coordinates (a cost,bsint) lies on the ellipse, 
since 
(a cos t)? (b sin t)? 2 a) 
72 7p =cos*t+sin*t=1. 


Then, just as for the parabola, we can check that 


x=acost, y=bsint (t € (-7,7]) 
gives a parametric representation of the ellipse. 
We now summarize the above facts about ellipses (including circles) as 


follows. 


Ellipse in Standard Form An ellipse in standard form has equation 


2) 
Seat where a > b > 0, b? =a? (1-e?) 0 <e <1. 
a 


It can be described by the parametric equations 
@e(Sr- zr): 


L=ICOSi, VS pom 


If e > 0, it has foci (ae, 0) and directrices x = ta/e. 


Example 3 Let PQ be an arbitrary chord of the ellipse with equation 


2 2 
x y 
—+ 
a2 


Let M be the midpoint of PQ. Prove that the following expression is 
independent of the choice of P and Q: 


slope of OM x slope of PQ. 
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Since 0 < e < 1, we have 
thatO <b <a. 


directrix 


minor axis 


directrix 
x= -ale major axis centre x= ale 


Sometimes it is 
convenient to assume that 
t € [0, 27), for instance, 
instead of (—zr, 27]. Notice 
our notation for intervals: 


(p,q). [p. 4], [p. 9), (P. @] 


denote those real numbers 
x for which p <x <q, 
pPsx<g,psx <q, 
p <x <q, respectively. 


Another parametric 
representation of this 
ellipse is 
ee iy 
=a tae y 
yep 
; 1+?’ 


teR. 


xia +y/b = 1 
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Solution Let P and Q have the parametric coordinates (a cos f1, b sin t1) 
and (acost2,bsinf2), respectively. It follows that M has coordinates 
(5 (cos t) + cos fz), 4 (sin t) +sinf2)). 


Now, 
b(sint int 
the slope of OM = pica aan 
a(cos ty + COS f2) 
and 
b(sin t; — sin tf) 
the slope of PQ = ———————_—., 
a(cos ty — COS f2) 
so 


slope of OM x slope of PQ 
b(sinty + sinf2)  b(sint, — sinf) 


a(cost, +cosf2) a(cost, — cos fz) 


b? sin? t) — sin? to 


a2 cos? ty — cos? fa 


_ b? sin” t) — sin” to In general, 
= 0s ay) 2 
a 1 — sin’ t,) — (1 — sin‘ t : 
( 1) ( 2) cos? @ = 1 — sin? 6. 
b2 
— oe 


which is independent of the values of t; and fo. 


Problem 8 Let P be an arbitrary point on the ellipse with equation yt 
y 


= + x= 1 and focus F (ae, 0). Let M be the midpoint of FP. Prove that 
M lies on an ellipse whose centre is midway between the origin and F’. 


P (acost, b sin t) 


F (ae, 0) x 


xia +ly 1b = 1 


Hyperbola (e > 1) 

A hyperbola is the set of points P in the plane whose distance from a fixed 
point F is e times their distance from a fixed line d, where e > 1. We obtain a 
hyperbola in standard form if we choose 


1. the focus F to lie on the x-axis, and to have coordinates (ae,0), a > 0; 
2. the directrix d to be the line with equation x = a/e. 


Let P(x, y) be an arbitrary point on the hyperbola, and let M be the foot of 
the perpendicular from P to the directrix. Since FP = e- PM, by the definition 
of the hyperbola, it follows that FP” = e* - PM’; we may rewrite this equation 
in terms of coordinates as 


2 
(x — ae) + y? =e? (x - *) 
e 


x=a/e 
directrix 


= (ex —a)’. 


Multiplying out the brackets we get 


2 


x* — 2aex + ae” -- y? = e*x” — 2aex + a, 
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which simplifies to 


x7 (2? -1)-y? =a? (e?-1), 


or 


Substituting b for ave? — 1, so that b? = a? (e? = I); we obtain the standard 
form of the equation of the hyperbola 

2 
~ = 

Notice that this equation is symmetrical in x and symmetrical in y, so that 
the hyperbola also has a second focus F’(—ae,0) and a second directrix d’ 
with equation x = —a/e. 

The hyperbola intersects the x-axis at the points (+a,0). We call the seg- 
ment joining the points (4a,0) the major axis or transverse axis of the 
hyperbola, and the segment joining the points (0, +b) the minor axis or conju- 
gate axis of the hyperbola (notice that this is NOT a chord of the hyperbola). 
The origin is the centre of this hyperbola. 


Notice also that each point with coordinates (a sect, b tant), where ¢ is not 
an odd multiple of 27/2, lies on the hyperbola, since 


a’ sec*t  b* tan? t 1 


az b2 ~ 


Then, just as for the parabola, we can check that 


x=asect, y=btant (¢ € (—m/2,7/2) U (27/2, 37/2)) 
gives a parametric representation of the hyperbola. 

Two other features of the shape of the hyperbola stand out. Firstly, the 
hyperbola consists of two separate curves or branches. 


Secondly, the lines with equations 


x 
ab se 

divide the plane into two pairs of opposite sectors; the branches of the hyper- 
bola lie in one pair. As x — oo the branches of the hyperbola get closer and 
closer to these two lines. We call the lines y = +(b/a)x the asymptotes of the 
hyperbola. 
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major axis 


focus focus ra 


directrix directrix 


x=-ale yaale 


In general, 


sec? @ = 1 7 tan? 0. 


Points for which 

t € (—m/2,7/2) 
lie on the right branch of 
the hyperbola, and points 
for which 

t € (w/2, 37/2) 
lie on the left branch of 
the hyperbola. 


renale X= 40 S yay 
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We summarize the above facts as follows. 


Hyperbola in Standard Form A hyperbola in standard form has equation 


2 ag 
x azn where b? = a? (e?—1), d= 0,2 = ll. 


It has foci (+ae,0) and directrices x = -+ta/e; and it can be described by 
the parametric equations 


x=asect, y=btant (¢ € (—7/2,2/2)U @/2,37/2)). 


Problem 9 Let P be a point (sect, + tant), where (t € 
(—m/2, 2/2) U (1/2, 32/2)), on the hyperbola E with equation x? — 
2y =1. 


(a) Determine the foci F and F’ of E. 

(b) Determine the slopes of FP and F’P, when these lines are not 
parallel to the y-axis. 

(c) Determine the point P in the first quadrant on E for which FP is 
perpendicular to F’P. 


Rectangular Hyperbola (e = /2) 

When the eccentricity e of a hyperbola takes the value /2, then e? = 2 and 
b = a. Then the asymptotes of the hyperbola have equations y = =Ex, so 
that in particular they are at right angles. A hyperbola whose asymptotes are at 


right angles is called a rectangular hyperbola. 

Then, if we use the asymptotes as new x- and y-axes (instead of the original 
x- and y-axes), it turns out that the equation of the hyperbola can be written in 
the form xy = c’, for some positive number c. 

The rectangular hyperbola with equation xy = c* has the origin as its centre, 
and the x- and y-axes as its asymptotes. Also, each point on it can be uniquely 
represented by the parametric representation 


c 
x= ch =o where ¢t 4 0. 


We shall use rectangular hyperbolas later on. 


Polar Equation of a Conic 


For many applications it is useful to describe the equation of a non-degenerate 
conic in terms of polar coordinates r and 6. A point P(x, y) in the plane has 
polar coordinates (r, 0) if r is the distance OP (where O is the origin) and 0 is 
the anticlockwise angle between OP and the positive direction of the x-axis. 
Take the origin O to be the focus of the conic, d the directrix, M the foot 
of the perpendicular from a point P on the conic to d, N the foot of the 
perpendicular from O to d, and Q the foot of the perpendicular from P to ON. 


1: Conics 


Another parametric 
representation of this 
hyperbola is 


1+ or 
a y= 
1-12" 1-12 


x= 


te R—{+l}. 


In this problem you will 
find the identity 

sec” 6 = 1 + tan? 
useful. 


ya-x 


directrix 
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Then by the definition of the conic, we have OP = e - PM. We can rewrite 
this as 


r = e(ON — OQ) 
=e-ON-ercos0, 
or 
r(1+ecos0é) =e-ON 
=/, aconstant. 

It follows that the equation of the conic can be expressed in the form 
_ l 

1+ ecosé 
The polar form of the equation of a conic is often used in problems in 


Dynamics: for example, in determining the motion of a planet or of a comet 
round the Sun. 


r 


1.1.4 Focal Distance Properties of Ellipse and Hyperbola 


We now prove two simple but surprising results. We deal with the ellipse first. 


Theorem5 Sum of Focal Distances of Ellipse 

Let E be an ellipse with major axis (—a,a) and foci F and F’. Then, if P 
is a point on the ellipse, FP + PF’ = 2a. In particular, FP + PF’ is constant 
for all points P on the ellipse. 


YA 


ev 


directrix d' directrix d 


Proof Let d and d’ be the directrices of the ellipse that correspond to the foci 
F and F’, respectively. Then, since 


PF =e x (distance from P to d) 


and 
PF’ =e x (distance from P to d’), 


planet 


comet 
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Sun® 
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it follows that 


PF + PF’ =e x (distance between d and d’). 
= 2a, 


which is a constant. |_| 


The result of Theorem 5 can be used to draw an ellipse, using a piece of 
string fixed at both ends. A pencil is used to pull the string taut; then, as we 
move the pencil round, the shape that it traces out is an ellipse whose foci are 
the two ends of the string. 

Notice that, if we are given any three points F, F’ and P (not on the line 
segment F’F) in the plane, then there is only one ellipse through P with F 
and F’ as its foci. Its centre is the midpoint, O, of the segment F’F, its axes 
are the line along F’F and the line through O perpendicular to F’F, and its 
major axis has length PF + PF’. 

Also, if we are given any two points F and F’ in the plane, the locus of 
points P (not on the line segment F’ F) in the plane for which PF + PF’ isa 
constant is necessarily an ellipse. Thus the converse of Theorem 5 holds. 


There is an analogous result for the hyperbola. 


Theorem 6 Difference of Focal Distances of Hyperbola 
Let H be a hyperbola with major axis (—a,a) and foci F and F’. Then, if 
P is a point on the branch of the hyperbola that is closer to F, 


PF — PF = 2a; 
and, if P is a point on the branch of the hyperbola closer to F’, 
PF — PF = —2a. 


In particular, |PF’ — PF| is constant for all points P on the hyperbola. 


YA 


RV 


directrix d' directrix d 
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For, the location of the 
foci and the length of the 
major axis specify an 
ellipse uniquely. 


As a result, some books 
take the ‘Sum of Focal 
Distances Property’ as the 
definition of the ellipse. 


Conic Sections and Conics 21 


Proof We shall prove only the first formula; the proof of the second is similar. 

Let d and d’ be the directrices of the hyperbola that correspond to the foci 
F and F’ respectively, and let P be a point on the branch of the hyperbola that 
is closer to F’. Then, since 


PF =e x (distance from P to d) 
and 
PF’ =e x (distance from P to d’), 
it follows that 


PF’ — PF = e x (distance between d and d’) 


= 2a, 


which is a constant. |_| 


The result of Theorem 6 can be used to draw a hyperbola, this time using a 
piece of string and a stick. Choose two points F and F’ on the x-axis, equidis- 
tant from and on opposite sides of the origin. Hinge one end of a movable 
stick F’X at the focus F’; attach one end of a string of length @ (where @ is 
less than the length of F’X) to the end X of the stick and the other end of the 
string to F, and keep the string taut by holding a pencil tight against the stick, 
as shown. 

Then, as we move the pencil along the stick, the shape that it traces out is 
part of one branch of a hyperbola with foci F and F’. For, 


PF — PF = XF — (XP + PF) 
=XF —¢ 
= aconstant independent of P. 


We obtain the other branch of the hyperbola by interchanging the roles of F 
and F’ in the construction. 


Notice that, if we are given any three points F, F’ and P (not on the line 
through F’ F or its perpendicular bisector) in the plane, then there is only one For, the location of the 
hyperbola through P with F and F’ as its foci. Its centre is the midpoint, O, foci and the length of the 
of the segment F’F, its axes are the line along F’F and the line through O ™4J0F axis specify a 

; , ‘ : ‘ , hyperbola uniquely. 

perpendicular to F’ F, and its major axis has length |PF" — PF. 

Also, if we are given any two points F and F’ in the plane, the locus of 
points P (not on the line segment F’ F) in the plane for which PF’ — PF is a 
non-zero constant is necessarily one branch of a hyperbola. Thus the converse As a result, some books 
of Theorem 6 holds, in the following sense: Given any three points F, F’ and _ take the “Difference of 
P (where P must lie strictly between F and F’ if it lies on the line through Fecal Distances Property’ 
F'F) in the plane for which PF’ — PF 4 0, the locus of points Q in the plane ao oni 
for which OF’ — QF = +|PF’ — PP| is a hyperbola. 
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1.1.5 Dandelin Spheres 


We now give a beautiful proof due to Dandelin of the fact that a slant plane Germinal Pierre Dandelin 
z that cuts one portion of a right circular cone in a ‘complete’ curve E is an (1794-1847) was a 
ellipse, just as it appears to be! French—Belgian Professor 

To do this, first fit a sphere inside the cone so that touches the plane z (at hae eee 1s Dee 

: : : ; . University; he made his 

a point F) and the cone (in a circle C with centre O), as shown in the figure discovery in 1822. 
below. The circle C lies in a horizontal plane which intersects z in a line d. 
Take an arbitrary point P on the curve E, and extend the line from the vertex 
V of the cone through P to meet C at the point L, and let D be the point on d 
such that PD is perpendicular to d. 

The line PD lies in a vertical plane which intersects the vertical plane VLO 
in the line PM, so that APMD and APML are both right-angled triangles. 

Denote by a@ the angle between the slant plane z and the horizontal plane 


through C, and by # the angle “PLM (the base angle of the cone). We use the symbol 2 to 
indicate an angle. 


From the right-angled triangles APMD and APML, we see that 


PM PM 
L=— and PD= —., 
sin 6 sina 
so that . 
PL _ sina 
PD sin’ 


Now PF = PL since they are both tangents from a given point to a given 


sphere; it follows that 
PF sina 
— = —.. (11) 
PD sing 
Now 0 < a < f < 4, since the plane z is less steep than the base angle of 
the cone. So if we let e = sina/ sin f, it follows that 0 < e < 1. 
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It then follows from equation (11) that for any point P on the curve E, 
its distance PF from the fixed point F is e times its distance PD from 
the fixed line d. Since 0 <e <1, it follows from the focus—directrix defini- 
tion of an ellipse that the curve E must be an ellipse, with focus F and 
directrix d. 

If we then construct the other sphere that touches both z and the cone, a 
similar argument shows that the point of contact of the sphere with z is the 
other focus F’ of the ellipse; and the other directrix of the ellipse is the line 
of intersection of z with the horizontal plane through the circle in which the 
sphere touches the cone. 

A similar construction involving spheres proves that in the cases 4 and 
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6 illustrated in the sketch in Subsection 1.1.1 the curve of intersection is a We ask you to look at the 
parabola and a hyperbola, respectively. two cases 4 and 6 in 


Exercise 7 of 


This completes the proof of our claim in Subsection 1.1.1 that the curves of ; 
Subsection 1.5. 


intersection of certain planes with a double cone are an ellipse, a parabola or a 
hyperbola. We shall investigate the converse in Theorem 4 of Subsection 4.1.4. 


1.2 Properties of Conics 


1.2.1. Tangents 


In the previous section you met the parametric equations of the parabola, Subsection 1.1.3 
ellipse and hyperbola in standard form. 

We now tackle a rather natural question: given parametric equations x = eure 
x(t), y = y(t) describing a curve, what is the slope of the tangent to the curve 
at the point with parameter t? This information will enable us to determine the 
equation of the tangent to the curve at that point. tangent 


(x(t), y(t) 


Theorem 1 The slope of the tangent to a curve in R* with parametric 
equations x = x(t), y = y(f) at the point with parameter f is 


y'(t) 
nile). 
provided that x’(r) 4 0. eeeeeee” 


Proof The points on the curve with parameters ¢ and t + h have coordinates — i), yr) 
(x(t), y(t)) and (x(t +h), y(t + h)), respectively. Then, if h 4 0, the slope of y(t+h)=x(0) 
the chord joining these two points is 


yt +h) — yr) 
x(t +h) — x(t)’ 
which we can write in the form 


(y(t +h) — yt))/h 
CE+h)=20))/h 


y(t+h) 
—y(t) 
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We then take the limit of this ratio as h — 0. The slope of the chord tends to 
the slope of the tangent, namely y’(t)/x’ (rt). | 


Example 1 


(a) Determine the equation of the tangent at the point with parameter f to the 
ellipse with parametric equations 


x=acost, y=bsint, 


where ¢t € (—z,7],t 40,7. 

(b) Hence determine the equation of the tangent to the ellipse with parametric 
equations x = 3cos f, y = sin ¢ at the point with parameter f = 7/4. 
Deduce the coordinates of the point of intersection of this tangent with the 
X-axis. 


Solution 


(a) Now, y(t) = b cos t and x(t) = —a sin t fort € (—z, 7]; it follows 
that, for t ~ O or z, the slope of the tangent at the point with parameter 
t is 
y(t) b cos t 
x(t) —asint’ 


Hence the equation of the tangent at the point (a cos f, b sin t), t 4 0, 
IT, 18 
b cos t 


y-bsint=—- (x — acos ft). 


asin t 


Multiplying both sides and rearranging terms, we get 
xbcos t+ ya sin t = ab cos” t +ab sin? t = ab, 
and dividing both sides by ab gives the equation 


x y., 
—cost+—snt=1. (1) 
a b 
The point on the ellipse where t = 0 is (a, 0), at which the tangent has 
equation x = a. Similarly, the point on the ellipse where t = zr is (—a, 0), 
at which the tangent has equation x = —a. It follows that equation (1) 
covers these cases also. 
(b) Here the curve is the ellipse in part (a) in the particular case that a = 3, 
b = 1. When t = 27/4, it follows from equation (1) that the equation of 
the tangent at the point with parameter t = 71/4 is 
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“W 


We shall use this equation 
in Subsection 1.2.2. 
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or 
5x ty=v2. 


Hence, at the point T where the tangent crosses the x-axis, y = 0 and so 
x = 3¥2. Thus, T is the point (3v2, 0). 


Problem 1 Determine the slope of the tangent to the curve in R? with 
parametric equations 


x =2cost+cos2t+ 1, y = 2sin¢ + sin2t 


BV 


at the point with parameter ¢, where ¢ is not a multiple of 2. Hence 
determine the equation of the tangent to this curve at the points with 
parameters f = 2/3 andt = 7/2. 


cardioid 


x=2cost+cos 2r+1 


Problem 2 y=2sin¢+sin 21 


(a) Determine the equation of the tangent at a point P with parameter 
t on the rectangular hyperbola with parametric equations x = f, 
y= I|/t. 

(b) Hence determine the equations of the two tangents to the rectangular 
hyperbola from the point (1, —1). 


We can modify the result of Example 1(a) to find the equation of the tangent 


> 
Xx 


at the point (x1, y,) on the ellipse with equation a + y = 1. We take x = 
acost, y = bsint as parametric equations for the ellipse, and let x} = acos fy 
and yj = bsint,. Then it follows from equation (1) above that the equation of 
the tangent is 


x y, 
— cost; + —sint; = 1, 
a b 


which we can rewrite in the form at + ae = |, 
We can determine the equations of tangents to the hyperbola and the 


parabola in a similar way; the results are given in the following theorem. 


Theorem 2 The equation of the tangent at the point (x1, y1) to a conic in 
standard form is as follows. 


Conic Tangent 
2 2) 

: x y XX] YY 
BIER yay ee 1 
ee ay Seer ae ce 
yperbola 7 _ nee a = ou 


Parabola y* = 4ax yy, = 2a(x + xj) 
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Problem 3 Prove that the equation of the tangent at the point (x1, y1) 
to the rectangular hyperbola xy = 1 is 5 (xy p+xiy=1. 


Problem 4 For each of the following conics, determine the equation 
of the tangent to the conic at the indicated point. 


(a) The unit circle x7 + y? = 1 at (-3, I 3). 
(b) The hyperbola xy = 1 at (-4, -1). 
(c) The parabola y* = x at (1,—1). 


We can deduce a useful fact from the equation xx; + yy, = | for the tangent 
at the point (x1, y;) to the unit circle x? + y* = 1. Let (a,b) be some point on 
this tangent, so that 


ax; + by, = 1. (2) 
Next, let the other tangent to the unit circle through the point (a,b) touch the 
circle at the point (x2, y2); it follows that 


ax2 + byy = 1. (3) 


From equations (2) and (3) we deduce that the points (x1, y1) and (x2, y2) 
both satisfy the equation ax + by = 1. Since this is the equation of a line, it 
must be the equation of the line through the points (x1, y1) and (x2, y2). For 
historical reasons, this line is called the polar of (a,b) with respect to the unit 
circle. 


Theorem3 Let (a, b) be a point outside the unit circle, and let the tangents 
to the circle from (a, b) touch the circle at P; and P). Then the equation of 
the line through P; and P> is 


ax+by=1. 


For example, the polar of (2, 0) with respect to the unit circle is the line 
24 = 1, 


Problem 5 Determine the equation of the polar of the point (2, 3) with 
respect to the unit circle. 


In the next example we meet the idea of the normal to a curve. 


Definition The normal to a curve C at a point P on C is the line through 
P that is perpendicular to the tangent to C at P. 


1: Conics 


This is because (a, b) lies 
on the tangent at (x2, y2), 
whose equation is 

xx2 + yy = 1. 


We shall meet polars of 
other conics in 
Subsection 4.2.1. 


polar of 
(a, b) 


curve 


P 
tangent 


normal 
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Example 2 
(a) Determine the equation of the tangent at the point with parameter f to the 
parabola with parametric equations 


x =atr’, y=2at (tER). 


(b) Hence determine the equations of the tangent and the normal to the 
parabola with parametric equations x = 2t?, y = 4¢ at the point with 
parameter ¢ = 3. 


Solution 


(a) Since y’(t) = 2a and x’(t) = 2at, it follows that, for t 4 0, the slope of 
the tangent at this point is 


y(t) 2a 1 
x(t) 2at it 


Hence the equation of the tangent at the point (at*, 2at), t 4 0, is 
i 2 
y — 2at = 7a ), 


which can be rearranged in the form 
ty=x+ at’. (4) 


The point on the parabola at which t = 0 is (0, 0); there the tangent to the 
parabola is the y-axis, with equation x = 0. It follows that equation (4) 
covers this case also. 

Here the curve is the parabola in part (a) in the particular case that a = 2. 
When ¢ = 3, it follows from equation (4) that the equation of the tangent 
is3y =x+2-3?,or3y=x4+18. 

To find the equation of the normal, we must find its slope and the 
coordinates of the point on the parabola at which t = 3. 

When ¢t = 3, it follows from the equation of the tangent that the slope 
of the tangent is i. Since the tangent and normal are perpendicular to each 
other, it follows that the slope of the normal must be —3. Also, when ¢ = 3, 
we have that x = 2-37 = 18 and y = 4-3 = 12; so the corresponding 
point on the parabola has coordinates (18, 12). 

It follows that the equation of the normal to the parabola at the point 
(18, 12) is 


(b 


we 


y—12=—3(x — 18) 
= —3x + 54, 


or 


y = —3x + 66. 
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We shall use this equation 
in Subsection 1.2.2. 


Recall that lines of 
(non-zero) slope my, and 
my are perpendicular if 
and only if my -m2 = —1. 
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Problem 6 The normal to the parabola with parametric equations 
x =f?, y = 2t (t € R) at the point P with parameter t, t 4 0, meets the 
parabola at a second point Q with parameter T. 


(a) Prove that the slope of the normal to the parabola at P is —t. 

(b) Find the equation of the normal to the parabola at P. 

(c) By substituting the coordinates of Q into your equation from part 
(b), prove that T = -2 —t. 


Problem 7 This question concerns the parabola with parametric 
equations x = at?, y = 2at (t € R). 


(a) Determine the equation of the chord joining the points P; and P2 on 
the parabola with parameters ¢; and f2, respectively, where ft; and fz 
are unequal and non-zero. 


Now assume that the chord P; P2 passes through the focus (a, 0) of the 
parabola. 


(b) Prove that tjf2 = —1. 

(c) Use the result of Example 2(a) to write down the equations of the 
tangents to the parabola at P; and P 2, and to prove that these 
tangents are perpendicular. 

(d) Find the point of intersection P of the two tangents in part (c), and 
verify that it lies on the directrix x = —a of the parabola. 

(e) Find the equation of the normal at the point O(at?, 2at) to the 
parabola. Hence prove that if the normal at Q passes through the 
focus F'(a,0), then Q is the vertex of the parabola. 


1.2.2 Reflections 


We use the reflection properties of mirrors all the time. For example, we look 
in plane mirrors while shaving or combing our hair, and we use electric fires 
with reflecting rear surfaces to throw radiant heat out into a room. 

All reflecting surfaces — mirrors, for example — obey the same Reflection 
Law. The Reflection Law is often expressed in terms of the angles made with 
the normal to the surface rather than the surface itself. However in this section 
we shall state and use it in the following form. 


The Reflection Law The angle that incoming light makes with the tangent 
to a surface is the same as the angle that the reflected light makes with the 
tangent. 
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P(t?, 2t) 


RV 


Q(T’, 2T) 


directrix 


A line is normal to a 
surface in R? if it is 
perpendicular to the 
tangent plane to the 
surface at its point of 
intersection with the 
surface. 


Radio waves or radiant 
heat, etc. obey the same 
Reflection Law. 
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ees (o~ _~0) 


plane mirror curved mirror 


This law applies to all mirrors, no matter whether the reflecting surface is 
plane or curved. Indeed, in many practical applications the mirror is designed 
to have a cross-section that is a conic curve — for example, the Lovell radio- 
telescope at Jodrell Bank in Cheshire, England uses a parabolic reflector to 
focus parallel radio waves from space onto a receiver. 

We now investigate the reflection properties of mirrors in the shape of the 
non-degenerate conics. 


Reflection Property of the Ellipse 
We start with the following interesting property of the ellipse. 


Reflection Property of the Ellipse Light which comes from one focus 
of an elliptical mirror is reflected at the ellipse to pass through the second 
focus. 


In our proof we use the following trigonometric result for triangles. 


Sine Formula Ina triangle AABC with sides a, b, c opposite the vertices 
A, B, C, respectively, 


a _ b _ Cc 
sinZBAC sin ZABC _ sin ZACB’ 


Proof of Reflection Property Let E be the ellipse in standard form, and 
P(acost,bsinf) an arbitrary point on F; for simplicity, we shall assume that 
P lies in the first quadrant. 

Then, as we saw earlier, 


PF = e x (distance from P to corresponding directrix d) 


a 
=eC xX (= — acoss) =a-—aecost, 


e 
and 
PF’ =e x (distance from P to d’) 
=ex (= + acost) =a-+aecost. 
Hence, 


PF a —aecost 1 —ecost 


PF a+aecost 1+ecost 
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Lovell radio-telescope. 


= 
IX 


Subsection 1.1.3 
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Next, we saw earlier that the equation of the tangent at P to the ellipse is Example 1(a) 


Pee P (a cos t, b sin t) 
T 

> 

x 


x y, 
—cost++sint = 1; 
a b 


hence at the point T where the tangent at P intersects the x-axis, we have 


x 
—cost=1, or x =a/cost. 
a 


It follows that 
TF (a/cost)—ae 1-—ecost 
TF (a/cost)+ae  1+ecost’ 


We deduce that 
PF TF Pr PF 


PP TF’ ° TF TF’ 
By applying the Sine Formula to the triangles APFT and APF’T, we obtain 
that 


PF  sinZPTF F PF’ sinZPTF’ 
— = —— and — = —— , 
TF sinZTPF TF sin ZTPF’ 


so that 
sinZPTF _ sin ZPTF’ 


sinZTPF sin ZTPF” 

Since ZPTF = ZPTF' it follows that sin ZTPF = sin ZTPF’, so that B a 
ZIPF = 1 — ZTPF' since ZTPF # ZTPF’. Hence TPF equals the angle 
denoted by the symbol a in the diagram, and this is equal to the angle 6 (as a 
and 6 are vertically opposite). 

This completes the proof of the Reflection Property. | 


y 
y 


An amusing illustration of the property is as follows. A poor snooker player 
could appear to be a ‘crack shot’ if he used a snooker table in the shape of an 
ellipse: for if he places his snooker ball on the table at one focus and a target 
ball at the other focus, then no matter what direction he hits his ball, he is 
certain to reach his target! 


Reflection Property of the Hyperbola 


The hyperbola has a reflection property similar to that of the ellipse, with an 
appropriate modification. 


Properties of Conics 


Reflection Property of the Hyperbola Light coming from one focus of 
a hyperbolic mirror is reflected at the hyperbola in such a way that the light 
appears to have come from the other focus. 

Also, light going towards one focus of a hyperbolic mirror is reflected at 
the mirror towards the other focus. 


We omit a proof of this result, as it is similar to the proof of the Reflection 
Property of the ellipse. 


Reflection Property of the Parabola 


The Reflection Property of the parabola is also similar to the reflection property 
of the ellipse. 


Reflection Property of the Parabola Incoming light parallel to the axis 

of a parabolic mirror is reflected at the parabola to pass through the focus. 
Conversely, light coming from the focus of a parabolic mirror is reflected 

at the parabola to give a beam of light parallel to the axis of the parabola. 


Proof Let E be the parabola in standard form, and let P(at?,2at) be an 
arbitrary point on E. 

We have seen that the equation of the tangent at P to the parabola has equa- 
tion ty = x + at’. If T is the point where this tangent meets the x-axis, then at 
T we have y = Oandt-0=x + at”, so that x = —a??. 

In the triangle APTF we have 


TF =TO+ OF =at+a 


and, by the Distance Formula, 


FP= V(a — at”)? + (at)? = Vva2 + 2a2t2 + a2t4 
=at at’. 


Then, since TF = FP, the triangle APTF is isosceles, and so ZTPF = ZFTP. 

Now since the horizontal line through P is parallel to the x-axis, the angle 
between the tangent at P and the horizontal line through P is equal to ZFTP 
(as they are corresponding angles), and so also to ZTPF. This completes the 
proof of the reflection property. | 


The reflection property of the parabola is also the principle behind the design 
of searchlights as well as radio-telescopes. The reflector of a searchlight is a 
parabolic mirror, with the bulb at its focus. Light from the bulb hits the mirror 
and is reflected outwards as a parallel beam (see p. 32). 

The design of optical telescopes sometimes uses the Reflection Properties 
of other conics too. For example, the 4.2 metre William Herschel telescope 
at the Roque de los Muchachos Observatory on the island of La Palma in the 
Canary Islands, has an arrangement of mirrors known as a Cassegrain focus: 
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This is called the Internal 


Reflection Property. 
This is called the External 
Reflection Property. 
y 
F % 
Example 2 


T (-at’, 0) 
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a primary parabolic mirror reflects light towards a secondary hyperbolic mirror, 
which reflects it again to a focus behind the primary mirror. 


searchlight radio-telescope 


secondary mirror 


1 

1 

’ : \ 

(hyperbolic) primary mirror 1 
\ 


(parabolic) 


Cassegrain focus 


The secondary mirror is used to focus the light to a much more convenient 
place than the focus of the primary mirror, and to increase the effective focal 
length of the telescope (and so its resolution). 

We can summarize the above three Reflection Properties concisely as fol- 
lows. All mirrors in the shape of a non-degenerate conic reflect light coming 
from or going to one focus towards the other focus. 


Problem 8 Let £ and ZH be an ellipse and a hyperbola, both having 
the same points F and F’ as their foci. Use the reflection properties of 
the ellipse and hyperbola to prove that at each point of intersection, E 
and H meet at right angles. 


1.2.3 Conics as envelopes of tangent families 


We now show how we can construct the non-degenerate conics as the envelope 
of a family of lines that are tangents to the conics. In other words, the conic 
being constructed is the curve in the plane that has each of the lines in the 
family as a tangent. 

The method depends on the use of a circle associated with each non- 
degenerate conic, called its auxiliary circle. The auxiliary circle of an ellipse 


1: Conics 


In the case of the 
parabola, we regard the 
second focus as ‘lying at 
infinity’. 


Such methods are often 
used in exhibitions to 
display the shapes of the 
conics in a visually 
appealing way, using 
coloured threads or string. 


Properties of Conics 


or hyperbola is the circle whose diameter is its major axis; analogously we 
shall define the tangent to a parabola at its vertex to be the auxiliary circle of 
the parabola. 


4 A 
: > > 
vertex axis A A 
major axis 
parabola ellipse auxiliary 
directrix circle 
= auxiliary 
circle 
A 
ae =e 
major | axis 
hyperbola hyperbola 


auxiliary circle 
The mathematical tool that we use in our construction is the following result. 


Theorem 4 A perpendicular from a focus of a non-degenerate conic to a 
tangent meets the tangent on the auxiliary circle of the conic. 


Proof (for a parabola) Let the point P(at?,2at) lie on the parabola in stan- 
dard form with equation y? = 4ax, and let the perpendicular from the focus 
F(a,0) to the tangent at P meet it at T. 

By Theorem 2 of Subsection 1.2.1, the tangent at P has equation 


y - 2at = 2a(x + at’), 


which we may rewrite in the form 


1 
y= -x + at. 
t 


(5) 


From this we see that the slope of the tangent PT is 1/t, so that the slope of the 
perpendicular FT must be —t. Since FT also passes through F(a, 0), FT must 
have equation 

ytux=0+t-a 
which we may rewrite in the form 


y = —tx + at. (6) 
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This definition for the 
parabola enables us to 
give succinct statements 
of properties for all 
conics. 


Here by ‘non-degenerate 
conic’ we mean a 
parabola, a (non-circular) 
ellipse or a hyperbola. 


Analytic proofs for an 
ellipse or hyperbola are 
similar; however for these 
conics there are two 
perpendiculars, both of 
which meet the tangent on 
the auxiliary circle. 
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The equations (5) for PT and (6) for FT clearly have the solution x = 0, 


y = at. This means that the point of intersection T of the lines PT and 
FT has coordinates (0, at). Hence T lies on the directrix of the parabola, as 
required. a 
Remark 


Given a parabola and its axis, we can use Theorem 4 to identify the focus of 
the parabola. We draw the tangent at any point P on the parabola, and then 
the perpendicular to the tangent at the point T where the tangent meets the 
directrix. This perpendicular crosses the parabola’s axis at its focus. 


Problem 9 Prove Theorem 4 for an ellipse. 


To construct the envelopes of the conics, you will need a sheet of paper, a 
pair of compasses, a set square and a pin. 


Parabola 


Draw a line d for the directrix of the parabola and a point F (not on d) for 
its focus. Place a set square so that its right-angled vertex lies at a point of d 
and one of its adjacent sides passes through F’;; draw the line @ along the other 
adjacent side of the set square. By Theorem 4, ¢ is a tangent to the parabola 
with focus F and directrix d. 

Repeating the process with the vertex of the set square at different points of 
d gives a family of lines @ that is the envelope of tangents to the parabola, as 
shown below. 


Ellipse 


Draw a circle C for the auxiliary circle of the ellipse and a point F inside C 
(but not at its centre) for a focus. Place a set square so that its right-angled 
vertex lies at a point of C and one of its adjacent sides passes through F’; draw 


1: Conics 


The directrix is simply the 
tangent where the axis 
cuts the parabola. 


The family of lines 
forming the envelope is 
{€: vertex of set square 
€ d}. 
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the line € along the other adjacent side of the set square. By Theorem 4, ¢ is a 
tangent to the ellipse with focus F and auxiliary circle C. 
Repeating the process with the vertex of the set square at different points The family of lines 


of C gives a family of lines that is the envelope of tangents to the ellipse, as forming the envelope is 
{€: vertex of set square 


EC}. 


shown below. 


auxiliary 
circle 


Hyperbola 


Draw a circle C for the auxiliary circle of the hyperbola and a point F outside 
C for a focus. Place a set square so that its right-angled vertex lies at a point 
of C and one of its adjacent sides passes through F; draw the line ¢ along 
the other adjacent side of the set square. By Theorem 4, ¢@ is a tangent to the 
hyperbola with focus F and auxiliary circle C. 
Repeating the process with the vertex of the set square at different points of 
C gives a family of lines that is the envelope of tangents to one branch of the The family of lines 
hyperbola, as shown below. forming the envelope is 
Repeating the construction with the other focus F’ (diametrically opposite {@ vertex of set square 
F with respect to C) gives the other branch of the hyperbola. mel 
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1.3. Recognizing Conics 


So far, we have considered the equation of a conic largely when it is in ‘stan- 
dard form’; that is, when the centre of the conic (if it has a centre) is at the 
origin, and the axes of the conic are parallel to the x- and y-axes. However, 
most of the conics which arise in calculations are not in standard form; thus 
we need some way of determining from the equation of a conic which type of 
conic it describes. 

First we observe that all the equations of all (non-degenerate) conics in 
standard form can be expressed in the form 


Ax’ + Bry + Cy’ + Fx+Gy+H =0, (1) 
where not all of A, B and C are zero. For example, the equation of the circle 
x? + y? + 4x + 6y — 23 =0 (2) 
is of the form (1), with A= C = 1, B =0, F = 4, G = 6and A = —23. 


Now we can obtain any non-degenerate conic from a conic in standard form 
by a suitable rotation 


(x,y)  (xcosé — ysiné,x sind + ycosé@) 
followed by a suitable translation 


(x,y) (x —a,y —b). 


Both of these transformations are linear, so that the equation of the conic at 
each stage is a second degree equation of the type (1); in other words, any 
non-degenerate conic has an equation of type (1). 

The equations of degenerate conics can also be expressed in the form (1). 
For example, 


x7+y=0 represents the single point (0, 0); 

y? — 2xy+x?=0 represents the single line y = x, since 
y= 2g HV =a 

y? —x7 =0 represents the pair of lines y = +x, since 


y? —x? =(y+x)(y— x). 


However, an equation of the form (1) can also describe the empty set; an 
example of this is the equation x? + y* + 1 = 0, as there are no points (x, y) 
in R* for which x? + y* = —1. For simplicity in the statement of the theo- 
rem below, therefore, we add the empty set to our existing list of degenerate 
conics. 


1: Conics 


The equations we have 
met were: 


(x —a)* + (y — by? =r’, 


y" = 4ax, 

2 2 
x y 
ae aes 2 
a2 2 
x2 y? - 
a bp 


This rotates the axes 
through an anticlockwise 
angle @ to align them with 
the axes of the conic. 


This moves the centre or 
vertex of the conic to the 
origin. 


This is an unexpected 
possibility! 


Recognizing Conics 
In the above discussion, we proved one part of the following result. 


Theorem 1 Any conic has an equation of the form 


Ax’ + Bry + Cy’ + Fx+Gy+H =0, (3) 


where A, B, C, F, G and H are real numbers, and not all of A, B and C are 
zero. Conversely, any set of points in R* whose coordinates (x, y) satisfy 
equation (3) is a conic. 


In this section we investigate the classification of conics in terms of equa- 
tion (3). In particular, if we are given the equation of a non-degenerate conic 
in the form (3) how can we determine whether it is a parabola, an ellipse or a 
hyperbola? And how can we identify its vertex or centre? And its axis, or its 
major and minor axes? A key tool in this work is the matrix representation of 
the equation of a conic. 


Introducing Matrices 


We can express a general second degree equation in x and y 


Ax? + Bxy + Cy” + Fx+ Gy+ H =0, (4) 
where A, B and C are not all zero, in terms of matrices as follows. 
A 5B F x 
Let A = dal ) ana x = ( ) Then 
5B C G ? 
A iB 
x? Ax = (x y) : (* ) 
5B C y 
1 1 x 
= (Ar+ By xBx+ cy) ( ) 
J 
= Ax* + Bry + Cy* 
and 
Jx=(F G) (*) 
y 
= Fx+ Gy. 
We may therefore write the equation (4) in the form 
x’ Ax+ J’x+H =0. (5) 


For example, let E be the conic with equation 
3x? — 10xy + 3y? + 14x —2y+3=0. 
The equation of E is of the form (4) with A = 3, B = —10, C = 3, F = 14, 


G = —2 and H = 3. It follows from the above discussion that we can express 
the equation of E in matrix form as xPAx+ J'x+H = 0, where 


3-5 14 x 
Ge ae 7 (5). H=3 and al.) 
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We omit a proof of the 
converse part. It would 
simply be a reworking of 
the classification methods 
in the rest of the section. 


This will be useful, since 
we can then use the whole 
armoury of Linear 
Algebra to study such 
equations. 


Here we choose to regard 
1 x 1 matrices and real 
numbers as equivalent; 
this will cause no 
problems. 


This is called the matrix 
form of the equation (4). 
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Problem 1 Write the equation of each of the following conics in 
matrix form. 


(a) 11x? + 4xy + 14y* — 4x — 28y — 16 =0 
(b) x? — 4xy + 4y? — 6x —8y +5=0 


A key tool in our use of matrices will be the following result. 


Theorem 2 A 2 x 2 matrix P represents a rotation of R* about the origin 
if and only if it satisfies the following two conditions: 


(a) P is orthogonal; 
(b) det P= 1. 


Proof A matrix P represents a rotation about the origin (anticlockwise 
through an angle 6) if and only it is of the form 


cos@ —sin@ 
; 6 
(ee: cos @ ) (6) 
It is easy to verify that P satisfies conditions (a) and (b). 
b 
Next, let P = (‘ ;) be a matrix that satisfies conditions (a) and (b). 
c 
Then, since P is orthogonal, the vector (“) has length 1; that is, 
é 


a* +c? = 1. Thus there is a number @ for which 


a=cosd and c=sinéd. 


Also, since P is orthogonal, the vectors a pais and 2 are 
c sin @ d 


orthogonal; that is, (cos@ sin @) (3) =0Oor 
cosd-b+sind-d=0. 
So there exists some number A, say, such that 
b=-—AsinOd and d=dAcosé. 
Then since det P = 1, we have 
1 = ad — be =Acos*@ + Asin’ 9, 


so that A = 1. It follows that P must be of the form (6), and so represent a 
rotation of R* about the origin. o 


1: Conics 


If P is orthogonal, then 
det P = +1; when 

det P = —1, P represents 
reflection in the x-axis 
followed by a rotation. 


Recognizing Conics 


Using Matrices 


We now use the methods of Linear Algebra to recognize conics specified by 
their equations. 


Example 1 Prove that the conic E with equation 
3x? — 10xy + 3y? + 14x —2y+3=0 


is a hyperbola. Determine its centre, and its major and minor axes. 


Solution We saw above that the equation of E can be written in matrix form 
as x? Ax + J’x + H =0, where 


3. -5 14 x 
t-(3 9. 1-(S)e west 2 (0) 


that is, as 


3-5 x x 
(x (3 3) +a -2)(*)+3=0. 


We start by diagonalizing the matrix A. Its characteristic equation is 


3-A -5 
0 = det(A = 2) = _5 ay 
=)* — 6A — 16 
= (A— 8) +2), 
so that the eigenvalues of A are A = 8 and A = —2. The eigenvector equations 


of A are 
(3 —A)x —Sy =0, 
—5x+3-A)y =0. 
When A = 8, these equations both become 


—5x —S5y =0, 


é , 1 F 
so that we may take as a corresponding eigenvector ( a which we 


1/V2 
normalize to have unit length as ( _ ne ‘ 


When A = —2, the eigenvector equations of A become 
5x — 5y = 0, 
—5x+5y =0, 


oe 1 ‘ : 
so that we may take as a corresponding eigenvector i} which we normalize 


to have unit length as ( i vA) ; 
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40 
Now 
v2 1/2 tad 4 
-1/V/2 1//2} 2 2 ” 
so we take as our rotation of the plane the transformation x = Px’ where 
1/V2 1/V2 
P= ( ca ae . The transformation x = Px changes the equation of 


the conic to the form 
(Px’)’ A(Px’) + J’ (Px’) + H =0 
or 


(x’)? (PT AP)x’ + (J’P)x’ + H =0. 


Since P? AP = ls 5 fit equation of the conic is now 
8 0)\/x' 
© (9 2)G) 
iid 25) Ne Nae? (<))+3=0 
—1f//2 1/2} \y’ > 


which we can rewrite in the form 
8x’? — 2y? + 82x + 6V2y' +3 =0. 
We may rewrite this equation in the form 
8 (s* 4 V2x') ~2 (v? - 3v2y') seg 
so that, on completing the square, we have 
8 (x'+ v2) —4~2(y’ —3/v2) +9+3=0, 
which we can rewrite in the form 
8 (x' es 1/v2)_ 29 (y' = 3/2) = -8, 


or 


(vn (e1na) 
4 1 = 
This is the equation of a hyperbola. 
From equation (7) it follows that the centre of the hyperbola E is the point 
where x’ = —1/,/2 and y’ = 3/./2. From the equation x = Px’, it follows 
that in terms of the original coordinate system this is the point 


(*)- 1/V2 1/V2\ (-1/V2 
¥J 2/2 1/2] \ 3/72 


-(). 


(7) 


that is, the point (1, 2). 


1: Conics 


Note that we need to 
check the order in which 
the normalised 
eigenvectors appear in P 
to ensure that P represents 
a rotation. 


Since P is orthogonal and 
det P = 1, P represents a 
rotation — in fact, an 
anticlockwise rotation 
through —7zr/4; that is, a 
clockwise rotation 
through 7/4. 


For 
x? 4 6/2! 

2 
= (yi)! 
=('+%) -2 

and 
y* —3vV2y’ 


We can write this equation 
in (nearly) standard 
form as 

(y")? & 


4 1 


NY\2 
a7 


? 


where x” = x’ + —L and 


V2 


Recognizing Conics 


It also follows from equation (7) that the major axis of E has equation 
x’ +1//2=0, or x’ = —1/./2; and the minor axis of E has equation 
y’ —3/V2 = 0, or y’ = 3/V2. 

Finally, since the matrix P is orthogonal we can rewrite the equation x = Px’ 
in the form x’ = P~!x = P’ x, so that 


or as a pair of equations 


It follows that the equation, x’ = —1/ /2, of the major axis of the hyperbola 
E can be expressed in terms of the original coordinate system as 


1 


BP B= -y 


/2? 


or x-—y=-l. 
Similarly, the equation, y’ = 3//2, of the minor axis of the hyperbola can 
be expressed in terms of the original coordinate system as 


3 
V2’ 


1 = 
at + or x+y=3. 


1 — 
ae 


The above problem illustrates a general strategy for identifying conics from 
their second degree equations. 


Strategy To classify a conic E with equation 
Ax? + Buy + Cy? + Fx + Gy +H =(03 


1. Write the equation of E in matrix form x’ Ax + J’x+ H = 0. 

2. Determine an orthogonal matrix P, with determinant 1, that diagonal- 
izes A. 

3. Make the change of coordinate system x = Px’. The equation of E then 
becomes of the form 


Mie == iy ef x ey eh — 0, 


where A; and A are the eigenvalues of A. 

4. ‘Complete the squares’, if necessary, to rewrite the equation of E in 
terms of an (x”, y”)-coordinate system as the equation of a conic in 
standard form. 

5. Use the equation x’ = P’x to determine the centre and axes of E in 
terms of the original coordinate system. 
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Recall that the equation 

2 2 

2 - iz = | represents a 
hyperbola with major axis 
y = 0 and minor axis 
x=0. 


okt 


aE aah 
Recall that x” = x’ + a 


/ 


and y” = y’ — Won 


Reorder the columns of P 
if necessary to ensure that 
det P = | rather than —1. 


This is a rotation of R2. 


Here 4; corresponds to 
the first column in P, and 
A2 to the second column 
in P. 


This is a translation of R2. 


42 


Problem 2 Classify the conics in R? with the following equations. 
Determine the centre of those that have a centre. 


(a) 11x? + 4xy + 14y? — 4x — 28y — 16=0 
(b) x? — 4xy + 4y? — 6x —8y +5=0 


In fact, using the above strategy we can prove the following result. 


Theorem3 A non-degenerate conic with equation 
Ax? + Buy + Cy? + Fx + Gy +H = 
1 
is 7B 
7B OC 
(a) If det A < 0, E is a hyperbola. 


(b) If det A = 0, E is a parabola. 
(c) If det A > 0, F is an ellipse. 


and matrix A = ( ) can be classified as follows: 


Problem 3 Use Theorem 3 to classify the non-degenerate conics in 
IR? with the following equations. 


(a) 3x? — 8xy+ 2y? —2x + 4y— 16=0 
(b) x? + 8xy + l6y* —x + 8y—12=0 
(c) 52x” — 72xy + 73y? — 32x — 74y + 28 =0 


1.4  Quadric Surfaces 


1.4.1 Quadric Surfaces in R? 


Quadric surfaces (or quadrics) are surfaces in R? that are the natural analogues 
of those curves in R? that we call conics. 


Definition A quadric surface in R? is a set given by an equation of the 
form 


Ax’ + By? + C2 + Fxy + Gyz+ Hxz+Jx+Ky+iz+M=0, (1) 


where A, B, C, F, G, H, J, K, L and M are real numbers, and not all of 
A, B, C, F, G and H are zero. 


We use the term degenerate quadrics to describe those quadrics that are the 
empty set, a single point, a single line, a single plane, a pair of planes and a 
cylinder. 


1: Conics 


We omit a proof of this 
result. 


Since det A = AC — 

1p? = —1(B? — 4A0). 
Theorem 3 is often 
referred to as ‘the 

B2 — 4AC test’ for conics. 


Note that det A and 
B? — 4AC have opposite 
signs. 


You may assume that 
these conics are 
non-degenerate. 


Quadric Surfaces 


By a cylinder we mean a surface that consists of an ellipse, parabola or 
hyperbola in some plane zr, together with all the lines in R* through that conic 


that are normal to zr. 


! hyperbolic cylinder parabolic cylinder 
elliptic cylinder 


This leaves six different types of quadric surface, the so-called non- 
degenerate quadric surfaces. We illustrate these below, with a typical equation 
for each. In each case, we state also the curve of intersection of a plane parallel 
to a coordinate plane that meets the surface in a non-trivial intersection. 


ellipsoid hyperboloid of 1 sheet 


2 
x* 
a2 b2 2 


hyperbola or ellipse 


ellipse or hyperbola ellipse or hyperbola hyperbola or parabola 


As well as being attractive visually, quadrics arise naturally in various areas 


of applied mathematics. 
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1.4.2 


We use the notion of orthogonal diagonalization of 3 x 3 matrices to classify 
non-degenerate quadrics, just as we used 2 x 2 matrices to classify conics. 


Recognizing Quadric Surfaces 


Introducing Matrices 


Consider a quadric surface with equation 


Ax? + By’ + C2 + Fry + Gyz + Hxz + Jx + Ky +Lz+M =0, 


and let 
A 35F 35H ; 7 
A= 5F B 5G , J=|[K and x=|y 
lq 16 ¢ - 


Then a calculation similar to that for conics shows that 
x Ax = Ax’ + By? + C2? + Fxy + Gyz + Axz 
and 
J'x = Jx + Ky + Lz. 
We may therefore write the equation of the quadric surface in the form 
x’ Ax+ J’x+M=0. (2) 
For example, the equation of the quadric surface given by 
5x7 + 3y* + 3z” — Ixy + 2yz — 2xz — 10x + 6y — 2z-10=0 


may be written in matrix form x’ Ax + J’x + M = 0 where 


5 -l1 -l —10 x 
A=|-l 3 1], J= 6), x= and 
—1 1 3 —2 Zz 

M=-—_10. 


Problem 1 Write the equation of the following quadrics in matrix 


form. 


9x? Sy? = 2? tay 3yz — 2xz — 2x —6y+ 10z -—12=0 


YO YE XZ 
A key tool for classifying quadrics is the following result about matrices. 


Theorem 1 A 3 x 3 matrix P represents a rotation of R* about the origin 
if and only if it satisfies the following two conditions: 


(a) P is orthogonal; 
(b) det P= 1. 


1: Conics 


Section 1.3 


This is called the matrix 
form of the equation of the 
quadric surface. 


We omit a proof, as it is 
similar to that of Theorem 
2 in Section 1.3. 


If P is orthogonal and 
det P = —1, then P 
represents a rotation about 
the origin composed with 
a reflection in a plane 
through the origin. 


Quadric Surfaces 


Using Matrices 


Our approach to classifying quadrics in R? using matrices is broadly similar to 
that for classifying conics in R?. 


Strategy To classify a quadric E with equation 
Ax’ + By? + C2 + Fry + Gyz + Hxz + Jn + Ky+1z+ M =0: 


1. Write the equation of E in matrix form x’ Ax + J’x+ M =0. 

2. Determine an orthogonal matrix P, with determinant 1, that diagonal- 
izes A. 

3. Make the change of coordinate system x = Px’. The equation of E then 
becomes of the form 


Aix”? + Agy? + A3z” + jx’ + ky + le +m =0, 


where 41, A2 and A3 are the eigenvalues of A. 

4. ‘Complete the squares’, if necessary, to rewrite the equation of E in 
terms of an (x”, y”, z’)-coordinate system as the equation of a quadric 
in standard form. 

5. Use the equation x’ = P’x to determine the centre and planes of 
symmetry of F in terms of the original coordinate system. 


Example 1 Prove that the quadric E with equation 
5x? + 3y? + 3z* — xy + 2yz — 2xz — 10x + 6y — 2z- 10 =0 
is an ellipsoid. Determine its centre. 


Solution We saw above that the equation of E can be written in matrix form 
as x! Ax + J’x + M = 0, where 


5 -l -l —10 x 
A=]-l 3 1}, J= 6], x=I|y and 
—l 1 3 —2 Zz 
M =-10; 
that is , as 
S =L =] x x 
(x y z)[-1 3 1]]y]+(-10 6 -2){y]-10=0. 
-1 1 3 Z Z 
We start by diagonalizing the matrix A. Its characteristic equation is 
S5-r -=l1 -1 
O=det(A-—AD=| -1 3-A 1 


| — 
= —A? +1122 — 362 + 36 
= —-(A— 2)(A — 3)(A — 6), 
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Reorder the columns of P 
if necessary to ensure that 
det P = | rather than —1. 


This is a rotation of R3. 


Here 4; corresponds to the 
ith column in P. 


This is a translation of R?. 


Just before Problem 1 
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so that the eigenvalues of A are A = 2, 3 and 6. The eigenvector equations of 
A are 


(5—A)x —-y—-—z=0, 
—x+ (3-A)y+z=0, 
—x+y+(3-A)z=0. 


When 4 = 2, these equations become 


3x —y—z=0, 
—x+ty+z=0, 
—x+y+z=0. 


Adding the first two equations we get x = 0; it then follows from all the 
equations that y + z = 0. So we may take as a corresponding eigenvector 


0 0 
1 |, which we normalize to have unit length as | 1//2 
=I —1/V2 


Similarly, when 2 = 3, we may take as a corresponding eigenvector | | ], 


1/V3 
which we normalize to have unit length as | 1/ J3 ; and when A = 6, we 
1/3 
2 
may take as a corresponding eigenvector {| —1 ], which we normalize to have 
—1 
2//6 
unit length as | —1//6 
-1//6 
Now 
oO LL & 0 LL & 
V3 v6 V3 6 
1 1 -1 1 1 -1 
v2 v3 vo] | v2 v3 V6 
al. gh. jel 0 +4 = 
v2 v3 V6 V3 v6 
tt 2 
— 1 | v3 v6 
Se | 2. a2 
“lA OT 
elle fin 28-2), 
=—J5(-Jq) = +1 
so we take as a convenient rotation of R? the transformation x = Px’, 
0 1/V3 2/6 
where P = 1 2 1 [V3 —1 /V6 . This transformation changes the 


—1//2 1/V3 -1/76 


equation of the quadric to the form 
(Px’)? A(Px’) + J’ (Px’) + M =0 
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We omit the details. 


Adding row 2 to row 3 


Expanding in terms of the 
first column of the 
determinant 


P represents a rotation of 
R? since it is orthogonal 
and det P = 1. 
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or 
(x!) (PTAP)x’ + J" P)x’ + M = 0. 
2 0 0 
Since P'AP= | 0 3. 0 J, this is the equation 
0 0 6 
2 0 0 : 
(x y z)1/0 3 0 y’ 
0 0 6 zi 
1 2 
o 4% #)\/e 
+(-10 6 -Y/ am sz y | -10=0, 
-1 1 =1 : 
22 v3 V6 


which we can rewrite in the form 
2x! + 3y? + 67? 4+ 4/2x! — 2/3y' — 4/62’ — 10 = 0. 
‘Completing the square’ in this equation, we get 


7) + 22x’) +3 (»? _ 3y) +6(z? - 42) -—10=0 


so that 
2(x'+ v2) —443(y’— 4) —146(z/- 2)" 4 10 =0 
: ‘ V3 V6 ee 
We now make the transformation This is a translation of R?. 
sf 
x’ = x’ fe Fz ; (3) 

=2 
974 


so that we can rewrite the equation of EF in the form 
2x")? 4 a(y"* 4 6(2")" = 19, 


In the general form of the 


7 (x")? (y”)? (2)? equation of an ellipsoid in 
19/2 19/3 19/6 =1. (4) Subsection tt E has 
It follows from equation (4) that E must be an ellipsoid. naa ee ao =i 
From equations (3) and (4) it follows that the centre of the ellipsoid E is the ane = t78: 
point where x’ = —J/2, y’ = a and z’ = =. From the equation x = Px’, it 
follows that in terms of the original coordinate system this is the point 
1 a sal 


x 
y = 
z 


SILSE © 
SI- SES 
5 
5 


that is, the point (1, —1, 1). 
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Problem 2 Prove that the quadric surface E with equation 
VY YS XZ 


is a hyperbolic paraboloid. Determine its centre. 


1.4.3 Rulings of Quadric Surfaces 


We now turn our attention to two of the quadric surfaces, the hyperboloid of 
one sheet and the hyperbolic paraboloid. Each of these can be very beautifully 
constructed entirely from a family of straight lines. 


Definition A ruled surface in R? is a surface that can be made up from a 
family of straight lines. 


The Hyperboloid of One Sheet 
Firstly, we look at the hyperboloid of one sheet E with equation 


e+y—-2eal, 


illustrated below. 


The surface meets each horizontal plane in a circle whose centre lies on 
the z-axis; for example, the circles C; and C_, drawn in the figure, where 
the surface meets the planes z = 1 and z = —1, respectively; both of these 
circles have radius 2. The surface meets each plane containing the z-axis in 
a rectangular hyperbola. The surface appears rather like a cooling tower at a 
power station. 
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That is, each plane 
parallel to the 
(x, y)-plane. 


Quadric Surfaces 


Let £ be the line through the points (v2, 0, 1) on C, and (0, J/2, -1) on 
C_,. Any point on £ has coordinates 


i (v2.0, 1) 4 =H) (0. V2,-1) = (av2, (1 =2)9/2,21.= 1) (eR). 
(5) 


Clearly each point with coordinates given by equation (5) lies on the surface 
E, since 


(av2) (a = a2). — (24 — 1)? 
a7 4 tH 9  =  H S 
=1. 


In other words, the point lies on E for any choice of the parameter A; so the 
whole of the line @ lies in the surface E. 

We now use the fact that the surface is symmetric about the z-axis; in other 
words, a rotation about the z-axis carries the surface to itself. Our line & meets 
each horizontal circle in E in a single point; and, if we rotate the surface about 
the z-axis, 2 is moved to a new line, ¢’ say, which also lies in the surface, and 
which does not meet £. 

We can see that ¢’ does not meet £ because ¢’ meets each horizontal circle 
in a point different to that in which £ meets that horizontal circle — namely, in 
the point obtained by the rotation of the intersection point with ¢. 

So we say that the hyperboloid of one sheet E is generated by the straight 
line @ and the rotations of ¢ described above. These straight lines are called a 
family of generators (or generating lines), Z@ say, of E, and E is called a ruled 
surface. 

In fact, E possesses another family of generators too. 


Problem 3 Verify that the line m through the points (/2,0, 1) and 

(0, —/2, —1) lies entirely in the quadric surface E with equation x7 + 
ye?) 

yoo =1. 


There is thus a second family, .W@, say, of lines that are also generators of 
the surface E, and this is obtained by rotating the line m about the z-axis (as 
shown dotted in the diagram in the margin). 

From the construction of the families & and ./, it is clear that any two 
distinct lines in a given family do not meet. However, each line in “& meets 
each line in .@ (— with one exception, as we shall explain below). 

To prove this claim, it is sufficient (in view of the rotational symmetry of 
the surface E) to verify that the given line £ in @ meets each line in .#. Now, 
recall (from equation (5) above) that each point of @ has coordinates 


(av2, (IK? Dim we (6) 
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for some 4 € R. Ina similar way as you saw in your solution to Problem 3, a 
typical point of the line m in .@ has coordinates 


mn (v2.0, 1) 4 =p) (0, -v2,-1) 
= (uv2,(u— v2.2" - 1), (7) 


for some jz € R. We can then find the coordinates of points on any other line 
m’ in the family ./@ by rotating m about the z-axis through a suitable angle 0; 
that is, by the transformation 


(x,y,z) > (xcos@ — ysind,x sin@ + ycosé,z), Here we have 
which sends the point on m with coordinates (7) to some point x= pv2, 
(v2 cos@ — wsiné + sing), y= ("—1)¥2, and 
z=2y-1. 
/2(u sind + ucos 6 — cos6), 2 — 1) (8) 


ona line m' in.@. 

To find the point(s) of intersection of the lines £ and m’, we have to find the 
values of A and yz for which the points (6) and (8) are equal. 

By comparing the third coordinates in (6) and (8), we see that we must have 


A. = pw. Then, by comparing the first coordinates, we find Here we have used the 


fact that 7A = pw. 
V2 = JV2(A.cos@ — Asin@ + sind), 


so that . 

7 sin @ (9) 

~ 1+sin@ —cos6° 

Finally, by comparing the second coordinates in (6) and (8), we find Here we have again used 
the fact that 7 = py. 


Xr 


(1 —A)V2 = V2(A sin@ + Acos6 — cos), 


so that 
1+ cos0 
4 = ——__—_.. (10) 
1+sin0é + cos@é 


The expressions for A in (9) and (10) are equal, since 


sin 9 1+ cos0é 
1+ sin@ — cosé la sin@ + cos @ 
sin@(1 + sind + cos @) 
~ (+ sin@ — cos6)(1 + cos) 


sin6@ + sin? 6 + sin@ cos@ 


~ 1+ sin6 + sin@ cos 6 — cos? 6 Here we use the fact that 
ae 1 — cos? 6 = sin? 0. 


It follows that the value of A (and so jz) given by (6) gives us the (unique) 
point of intersection of £ and m’. 

There is one exceptional case, however, when the expression (9) for A makes 
no sense because its denominator is zero: that is, when 0 = 37/2, in which 
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case cos 0 = 0) and sin@ = —1, and the expression for A has denominator zero. 


Geometrically this corresponds to the situation in which m has been rotated 
around to the back of the surface E, when m’ is actually parallel to 2. 


We have thus proved a special case of the following general result. 


Theorem 2 A hyperboloid of one sheet contains two families of gener- 
ating lines. The members of each family are disjoint, and each member 
of either family intersects each member of the other — with exactly one 
exception. 


It is precisely the existence of these two families of generators that give 
power station cooling towers great intrinsic structural strength. 


The Hyperbolic Paraboloid 


It turns out that the result of Theorem 2 is also valid for hyperbolic paraboloids. 
In the following problem we ask you to verify this in a particular instance. 


Problem 4 Let E be the hyperbolic paraboloid with equation x* — 


y+z=0. 


(a) Verify that the lines A = {(x,y,z): x+y =0,z = O} and B = 
{(x, y,Z) : x — y = 0,z = 0} lie in the surface E. 

(b) Verify that the point (A — w,A + “, 4A) lies in E, for each value of 
d and jw. 

(c) For each value of 4, let €, denote the set 0, = {A -— w,A+ 
LL, 404): € MR} lying in the surface E. Prove that @, is a line 
which meets B at (A,A,0), and which passes through the point 
(A — 1,44 1,4A). Identify the line 0, when A = 0. 

(d) For each value of yw, let m, denote the set m, = {(A — w,A + 
Ht,4A) : A © R} lying in the surface E. Prove that m,, is a line 
which meets A at (—j2, 4,0), and which passes through the point 
(1 — p,1+ py, 4). Identify the line m,, when yz = 0. 

(e) Let Y denote the set of all lines €, and .@ the set of all lines 
m,. Prove that the members of each family are disjoint, and that 


each member of either family intersects each member of the other in 
exactly one point. 


A particularly astonishing result is the following. Let E be any surface in R* 
that has two families of generating lines with the following property: the mem- 
bers of each family are disjoint, and each member of either family intersects 
each member of the other (— with at most one exception). Then E is neces- 
sarily a quadric surface, and is in fact either a hyperboloid of one sheet or a We omit a proof. 
hyperbolic paraboloid. 
In the 1950s in the UK, a Lincoln artist and architect, Sam Scorer, saw 
the beauty of the hyperbolic paraboloid shape, and realized that the shape had 
intrinsic structural strength that could be utilized for the building of large roofs. 
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The bracing effect of the two families of generators meant that roofs in this 

shape could be built of very thin reinforced concrete; these require relatively 

few vertical supports, leaving wide unobstructed space underneath. Among 

the remaining examples of his work are a library, a church and a former garage The Little Chef at 

that is now a restaurant. The roof of the Brisbane Exhibition and Conference Markham Moor on the Al 
Centre in Australia also uses the same shape. road in England. 


1.5 Exercises 


Section 1.1 


1. Determine the equation of the circle with centre (2, 1) and radius 3. 

2. Determine the points of intersection of the line with equation y = x + 2 
and the circle in Exercise 1. 

3. Determine whether the circles with equations 


2x? + 2y?-—3x-—4y+2=0 and x*+y?-—4x4+2y=0 : 

at,, 2at 
intersect orthogonally. Find the equation of the line through their points of ae 
intersection. 

4. This question concerns the parabola y> = 4ax (a > 0) with parametric 
equations x = at?, y = 2at and focus F. Let P and Q be points on the 
parabola with parameters ft, and f2, respectively. 

(a) If PQ subtends a right angle at the vertex O of the parabola, prove that 
th - fo = —4. 
(b) If t; = 2 and PQ is perpendicular to OP, prove that = —4. 

5. This question concerns the rectangular hyperbola xy = c* (c > 0) with 
parametric equations x = ct, y = c/t. Let P and Q be points on the 
hyperbola with parameters ¢ and f2, respectively. 

(a) Determine the equation of the chord PQ. 
(b) Determine the coordinates of the point N where PQ meets the x-axis. 
(c) Determine the midpoint M of PQ. 
(d) Prove that OM = MN, where O is the origin. 
6. Let P be a point in the plane and C a circle with centre O and radius r. 


Then we define the power of P with respect to C as You will use the results of 
this exercise in Chapter 5. 


Olaty, 2aty) 


power of Pwith respect to C = OP =F. 


(a) Determine the sign of the power of P with respect to C when 
(i) P lies inside C; 

(ii) P lies on C; 

(iii) P lies outside C. 
In parts (b) and (c) we regard distances as directed distances; that is, dis- iccioh 
tances along a line in one direction have a positive sign associated with 
their length and distances in the opposite direction have a negative sign 
associated with their length. x 


Exercises 


(b) 


(c) 


(d) 


7. (a) 


(b) 


If P lies inside C and a line through P meets C at two distinct points 
A and B, prove that 


power of P with respect to C = PA - PB. 


If P lies outside C, a line through P meets C at two distinct points A 
and B, and PT is one of the tangents from P to C, prove that 


power of P with respect to C = PA- PB 
=P, 


If C has equation x* + y* + fx + gy +h = O and P has coordinates 
(x,y), find the power of P with respect to C in terms of x, y, f, g 
and h. 

Let a plane z in R? meet both portions of a right circular cone, in two 
separate portions of a curve EF. Let the two spheres inside the cone 
(on the same side of z as the vertex) that each touch both the cone in 
a horizontal circle (C; and C2, respectively) and z touch mz at F and 
F’, respectively. Let P be any point of E, and the generator of the 
cone through P meet C; and C2 at A and B, respectively. Prove that 
PF’ — PF = AB. Deduce that E is a hyperbola. 

Let a plane z in R? that is parallel to a generator of a right circular cone 
meet the cone in a curve E. Let the sphere inside the cone (on the same 
side of z as the vertex) that touches both the cone in a horizontal circle 
C and x meet z at F. Let P be any point of EF, and the generator of 
the cone through P meet C at A. Let N be the foot of the perpendicular 
from P to the line of intersection of the horizontal plane and zr, and 
let NA meet C again at M. Prove that PF = PN. Deduce that E is a 
parabola. 


Section 1.2 


1. Determine the slope of the tangent to the cycloid in R* with parametric 
equations 


x=t-—sint, y=1-—cost 


at the point with parameter ¢, where ¢ is not a multiple of 27. 
2. Determine the equation of the tangent to the curve in R? with parametric 
equations 


x=14+4t+??, y=1-t 


at the point where ¢ = 1. 


3. Let P be a point on the ellipse with equation eee = 1,wherea > b> 0, 
b> = a*(1—e*), and0 <e <1. 


(a) 


If P has coordinates (acost,bsint), determine the equation of the 
tangent at P to the ellipse. 
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We promised to tackle 
these two situations 
earlier, in 

Subsection 1.1.5. 


a be directrix 


(b) Determine the coordinates of the point T where the tangent in part (a) 
meets the directrix x = a/e. 

(c) Let F be the focus with coordinates (ae, 0). Prove that PF is perpendic- 
ular to TF. 

. The perpendicular from a point P on the hyperbola H with parametric equa- 

tions x = 2sect, y = 3 tant, to the x-axis meets the x-axis at the point N. 

The tangent at P to H meets the x-axis at the point 7: 

(a) Write down the coordinates of N. 

(b) Find the coordinates of 7: 

(c) Prove that ON - OT = 4, where O is the origin. 

. Let P be a point on the ellipse with equation - + y = 1,wherea > b> 0, 

b? =a7(1—e*), and0 <e <1. 

(a) If P has coordinates (acost,bsint), determine the equation of the 
normal at P to the ellipse. 

(b) Determine the coordinates of the point Q where the normal in part (a) 
meets the axis y = 0. 

(c) Let F be the focus with coordinates (ae, 0). Prove that OF = e - PF. 

. Let F denote the family of parabolas {(x, y) : y? = 4a(x + a)} as a takes 

all positive values, and Y denote the family of parabolas {(x,y) : y? = 

4a(—x + a)} as a takes all positive values. Use the reflection property of 
the parabola to prove that, if F ¢€ ¥ and G € GY, then, at each point of 
intersection, F and G cross at right angles. 

. Prove that a perpendicular from the focus nearer to a point P on an ellipse 

meets the tangent at P on the auxiliary circle of the ellipse, in the following 

geometric way. 

It is sufficient to prove the result for the ellipse E : = + es =la> 

b > 0, and points P of E in the first quadrant. Let T be the foot of the 

perpendicular from F(ae,0) to the tangent at P, let T’ be the foot of the 

perpendicular from F’(—ae,0) to the tangent at P, and let FT meet F’P 

at X. 

(a) Prove that the triangles AFPT and AXPT are congruent. 

(b) Using the sum of focal distances property for E, prove that F’/X = 2a. 

(c) Prove that OT is parallel to F’X, where O is the centre of E. 

(d) Prove that OT = a, so that T lies on the auxiliary circle of E. 

Remark: A similar argument to that in parts (a)-(d) shows that OT’ = a, 
so that T’ also lies on the auxiliary circle of FE. 

. (a) Let E be an ellipse with major axis AB and minor axis CD, and let the 
tangents to E at A and B meet the tangent at D at the points T and T’, 
respectively. Prove that the circle with diameter 77’ cuts the major axis 
of E at its foci. 

(b) Let H be a hyperbola with major axis AB, whose midpoint is O, and let 
the perpendicular at A to the major axis meet an asymptote at a point 
T. Prove that the circle with centre O and radius OT cuts the major axis 
of H at its foci. 
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normal 


This is part of Theorem 4 
of Subsection 1.2.3. A 
similar argument works 
for the other focus, but we 
do not look at that here. 


Xx 


T 


> 


F(-ae0) |9 — F(ae,0) J 4 


E 


auxiliary 
circle 


This gives a method of 
locating the foci of an 
ellipse, given its major 
and minor axes. 


This gives a method of 
locating the foci of a 
hyperbola, given its major 
and minor axes. 


Summary of Chapter 1 


Section 1.3 


1. 


Classify the conics in R? with the following equations. Determine the 
centre/vertex and axis of each. 

(a) x? — 4xy — 2y? + 6x4+ 12y+21=0 

(b) 5x? + 4xy + 5y? + 20x + 8y —-1=0 

(c) x? —4xy + 4y? — 6x —8y +5=0 

(d) 21x? — 24xy + 3ly? + 6x +4y —25=0 

(e) 3x? — 10xy + 3y* + 14x —2y +3=0 


. Determine the eccentricities of the conics in parts (a), (b) and (c) of 


Exercise 1. 


Section 1.4 


i 


Classify the quadrics in R? with the following equations. Determine the 
centre of each. 

(a) 4x? + 3y? — 277 + 4xy + 4yz4t 12x + 1227 +18 =0 

(b) xy-—ytyz= xz 

(c) 5x? + Sy? + 622 + 2/2yz + 2 2xz + 2xy + 23x — 4/6y —1=0 
(d) —3x? + Ty? + 72x + 126y +z+95=0 


. Determine the equations of the generators of the hyperboloid of one sheet 


E with equation 2x” — 3y* + 4z* = 3 through the point (1, 1, 1). 


Summary of Chapter 1 


Section 1.1: Conic Sections and Conics 


ile 


Conics (or conic sections) are the shapes that we obtain by taking different 
plane slices through a double cone. 

The non-degenerate conic sections are parabolas, ellipses and hyper- 
bolas; the degenerate conic sections are the single point, single line and 
pair of lines. 


. The ellipse and the hyperbola both have a centre; that is, there is a point C 


such that rotation about C through an angle z is a symmetry of the conic. 


. The equation of a circle in IR? with centre (a, b) and radius r is (x — a)* + 


(y— bP =r. 

An equation of the form x*+ y*+fr+gy-+h = O represents a circle with 
centre (-3 : —}8) and radius ,/ af? + 48° —h, provided that af? + 
qg°—h>0. 


. Two intersecting circles C, and C2 with equations x* + y+ fix+giy+ 


hy = Oand x7 + y? + fox + gry + hz = 0, respectively, are orthogonal 
if and only if fi fo + gig2 = 2(h1 +h2). 


. Let C; and C3 be circles with equations x*+ y*+ fixt+giy+h; = Oand 


x?+y?+ fox+goy+h2 = 0 that intersect at distinct points P and Q. Then 
the line and all circles (other than Cz) through P and Q have an equation 
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10. 


11. 


12: 


13. 


14. 


. A hyperbola in standard form has equation xs - 


of the form x7+ y?+ fixtgrythitk(x? + y?4+ fox + goy +h) =0 
for some number k. 

If k A —1, this equation is one of the circles; if k = —1, this is the 
equation of the line. 


. The non-degenerate conics can be defined as the set of points P in the 


plane that satisfy the following condition: The distance of P from a fixed 
point (the focus) is a constant multiple e (the eccentricity) of the distance 
of P from a fixed line (the directrix). 

A non-degenerate conic is an ellipse if 0 < e < 1, a parabola if e = 1, 
or a hyperbola if e > 1. 


. A parabola in standard form has equation y? = 4ax, where a > 0. 


It has focus (a, 0) and directrix x = —a; and it can be described by the 
parametric equations x = at?, y = 2at (t € R). 


2 2 
. An ellipse in standard form has equation 2 + oy = 1,wherea > b> 0, 


b? =a7(1—”),0<e <1. 
It can be described by the parametric equations x = acost, y = bsint 
(t € (—m,m)); or by x = aS, y =b 25 (ER). 
If e > O, it has foci (ae, 0) and directrices x = +a/e. 
y 


oa = 1, where 5? = 


a*(e* = 1), 2 = 1. 

It has foci (ae, 0) and directrices x = +a/e; and it can be described 
by the parametric equations x = asect, y = btant (t € (—/2,7/2) U 
(1 /2,3/2)); or by x = alt y= bt, (t € R— {+]}). 

A rectangular hyperbola has its asymptotes at right angles, and has 
eccentricity e = 2. In standard form it has equation xy = c?, c > 0. 

It can be described by the parametric equations x = ct, y = £ (t #0). 
The polar equation of a conic with focus O can be expressed in the form 
r=I1/(1+ecos@) (6 € R). 

Sum of Focal Distances of Ellipse Let E be an ellipse with major axis 
(—a, a) and foci F and F’. Then, if P is a point on the ellipse, FP-+PF’ = 
2a. In particular, FP + PF’ is constant for all points P on the ellipse. 

Given any two points F and F’ in the plane, the locus of points P in the 

plane for which PF + PF’ is a constant is an ellipse. 
Difference of Focal Distances of Hyperbola Let H be a hyperbola with 
major axis (—2a,2a) and foci F and F’. Then, if P is a point on the 
branch of the hyperbola that is closer to F, PF’ — PF = 2a; and, if P is 
a point on the branch of the hyperbola closer to F’, PF’ — PF = —2a. In 
particular, |PF’ — PF| is constant for all points P on the hyperbola. 

Given any three points F’, F’ and P in the plane for which PF’—PF # 0, 
the locus of points Q in the plane for which QF’ — QF = +|PF’ — PF| is 
a hyperbola. 

Dandelin spheres Let a plane z cut one portion of a right circular cone 
in acurve E. Let F and F’ be the points of contact with z of two spheres 
that touch that portion of the cone (in circles C and C’, respectively) 
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Summary of Chapter 1 


and zr. Then E is an ellipse, with foci F and F’, and with directrices the 
lines of intersection of z with the planes through C and C’, respectively). 


Section 1.2: Properties of Conics 


1. 


The slope (or gradient) of the tangent to a curve in R* with paramet- 
/ 
ric equations x = x(t), y = y(t) at the point with parameter ¢ is =e, 


provided that x’(t) 4 0. 


. The equation of the tangent at the point (x1, y;) to a conic in standard form 


is as follows. 


Conic Tangent 
2 2 
‘ x y XX] yyy 
Ellipse mtpa! ae wo! 
2 2 
x y XX] Bal 
Hyperbola wp! ae 
Parabola yy? = 4ax yy, = 2a(x + x1) 


. Let (a,b) be a point outside the unit circle, and let the tangents to the 


circle from (a,b) touch the circle at P; and P2. The polar of (a,b) with 
respect to the unit circle is the line through P; and P; this has equation 
ax+ by = 1. 


. The normal to a curve C at a point P of C is the line through P that is 


perpendicular to the tangent to C at P. 


. The Reflection Law The angle that incoming light makes with the tangent 


to a surface is the same as the angle that the reflected light makes with the 
tangent. 


. Reflection Property of the Ellipse Light which comes from one focus of 


an elliptical mirror is reflected at the ellipse to pass through the second 
focus. 


. Sine Formula In a triangle AABC with sides a, b, c opposite the vertices 


A, B, C, respectively, 


a _ b _ C 
sinZBAC sin ZABC sin ZACB’ 


. Reflection Property of the Hyperbola Light coming from one focus of a 


hyperbolic mirror is reflected at the hyperbola in such a way that the light 
appears to have come from the other focus. 

Also, light going towards one focus of a hyperbolic mirror is reflected 
at the mirror towards the other focus. 


. Reflection Property of the Parabola Incoming light parallel to the axis 


of a parabolic mirror is reflected at the parabola to pass through the focus. 

Conversely, light coming from the focus of a parabolic mirror is 
reflected at the parabola to give a beam of light parallel to the axis of 
the parabola. 
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10. 


11. 


12. 


Reflection Property of Conics All mirrors in the shape of a non- 
degenerate conic reflect light coming from or going to one focus towards 
the other focus. 

The auxiliary circle of an ellipse or hyperbola is the circle whose diameter 
is its major axis. The auxiliary circle of a parabola is the tangent to the 
parabola at its vertex. 

A perpendicular from a focus of a non-degenerate conic to a tangent meets 
the tangent on the auxiliary circle of the conic. 

This property can be used to construct a parabola, ellipse or hyperbola as 
the envelope of a family of lines that are tangents to the conics. 


Section 1.3: Recognising Conics 


1. 


Any conic has an equation of the form Ax” + Bry+ Cy* + Fx+Gy+H =0, 
where A, B, C, F, G and H are real numbers, and not all of A, B and C are 
zero. Conversely, any set of points in R* whose coordinates (x, y) satisfy 
this equation is a conic. 

This equation can be expressed in matrix form as 


A 1B 
x’ Ax+J’x+H =0, wi = ( ? ) 


. A2 x 2 matrix P represents a rotation of R? about the origin if and only if 


it satisfies the following two conditions: 
(a) P is orthogonal; 
(b) det P = 1. 


. Strategy To classify a conic E with equation 


Ax” + Bry + Cy’ + Fx+ Gy+H =0: 

1. Write the equation of E in matrix form x’ Ax + J’x + H =0. 

2. Determine an orthogonal matrix P, with determinant 1, that diagonal- 
izes A. 

3. Make the change of coordinate system x = Px’. The equation of E then 
becomes of the form A;x’? + Ary? +f’ + gy’ +h = 0, where A, and 
Az are the eigenvalues of A. 

4. ‘Complete the squares’, if necessary, to rewrite the equation of E in 
terms of an (x”, y”)-coordinate system as the equation of a conic in 
standard form. 

5. Use the equation x’ = P’x to determine the centre and axes of E in 
terms of the original coordinate system. 


. A non-degenerate conic with equation Ax? + Bxy + Cy? ++ Fx+Gy+H =0 


and matrix A can be classified as follows: 
(a) If detA < 0, E is a hyperbola. 

(b) If det A = 0, E is a parabola. 

(c) If detA > 0, E is an ellipse. 
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Section 1.3: Quadric surfaces 


1. A quadric (or quadric surface) in R° is a set given by an equation of the 
form Ax? + By” + Cz? + Fxy + Gyz+ Hxz+Jx-+Ky+Lz+M = 0, where 
A, B, C, F, G, H, J, K, Land M are real numbers, and not all of A, B, C, F, 
G and H are zero. 
This equation can be expressed in matrix form as 


A 3F 4H 

x’ Ax + J’x+M =0, where A = ar B 5G , 
5H 5G C 
J x 


J=|K and x=] y 
6 Zz 


iw) 


. The degenerate quadrics are the empty set, a single point, a single line, 
a single plane, a pair of planes and a cylinder. A cylinder is a surface that 
consists of an ellipse, parabola or hyperbola in some plane zr, together with 
all the lines in R? through that conic that are normal to z. 

3. There are six non-degenerate quadrics: 


Quadric Typical equation 
x2 yy? 22 
Ellipsoid ” + p + a 1 


Hyperboloid of one sheet = + 2 cs =1 


ge yt 
Hyperboloid of two sheets ” + p a 1 
2 2 
x 
Elliptic cone C= 3 ie 3 
vy? 
Elliptic paraboloid Z= 2 + np 
2 2 


Hyperbolic paraboloid Z= 


4. A3 x 3 matrix P represents a rotation of R? about the origin if and only if 
it satisfies the following two conditions: 
(a) P is orthogonal; 
(b) det P= 1. 

5. Strategy To classify a quadric E with equation Ax” + By” + Cz? + Fxy + 
Gyz + Hxz+ Jx+ Ky+L1z+M=0: 
1. Write the equation of E in matrix form x’ Ax + J’x + M = 0. 
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2. Determine an orthogonal matrix P, with determinant 1, that diagonal- 
izes A. 

3. Make the change of coordinate system x = Px’. The equation of E then 
becomes of the form A,x’? + Azy? +4327 + jx + hy +1 +m =0, 
where 41, A2 and A3 are the eigenvalues of A. 

4. ‘Complete the squares’, if necessary, to rewrite the equation of E in 
terms of an (x”, y”, z’)-coordinate system as the equation of a quadric 
in standard form. 

5. Use the equation x’ = P’x to determine the centre and planes of 
symmetry of E in terms of the original coordinate system. 

A ruled surface in R? is a surface that can be made up from a family of 

straight lines, called its generating lines or generators. 


. A hyperboloid of one sheet and a hyperbolic paraboloid contain two fami- 


lies of generating lines. The members of each family are disjoint, and each 
member of either family intersects each member of the other — with exactly 
one exception. 

These are the only two quadrics that are ruled surfaces. 
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2 Affine Geometry 


In Chapter | we studied conics in Euclidean geometry. In the rest of the book 
we prove a whole range of results about figures such as lines and conics, in 
geometries other than Euclidean geometry. In the process of doing this, we 
meet two particular features of our approach to geometry which may be new 
to you. 

The first feature is the use of transformations in geometry to simplify prob- 
lems and bring out their essential character. You may have met some of these 
transformations previously in courses on Group Theory or on Linear Algebra. 

The second feature arises from the fact that the transformations we introduce 
form groups. Generally, we restrict our attention to geometry in the plane, 
IR, but even in this familiar setting there may be more than one group of 
transformations at our disposal. This leads to the exciting new idea that there 
are many different geometries! 

Each geometry consists of a space, some properties possessed by figures 
in that space, and a group of transformations of the space that preserve these 
properties. For example, Euclidean plane geometry uses the space R?, and 
is concerned with those properties of figures that depend on the notion of dis- 
tance. The group associated with Euclidean geometry is the group of isometries 
of the plane. 

This idea, that geometry can be thought of in terms of a space and a 
group acting on it, is called the Kleinian view of geometry, after the 19th- 
century German mathematician Felix Klein who proposed it first. It has the 
virtue of enabling us to generate many geometries, while seeing how they are 
related. 

For instance, we can take IR? as our space and use the group of all transfor- 
mations of the form t(x) = Ax + a, where a € R? and A is a2 x 2 invertible 
matrix. These are the so-called affine transformations of R?. But what prop- 
erties of figures in R* are preserved by such transformations, and what is the 
corresponding geometry? The geometry is called affine geometry, and it is the 
subject of this chapter. As you will see, it has some features in common with 
Euclidean geometry, but also some very different features. 

In Section 2.1 we examine Euclidean geometry from the Kleinian point of 
view, and explain why geometries other than Euclidean geometry exist. 


In this book, we shall 
often use matrices to 
simplify our work. 


This is because isometries 


of the plane preserve 
distances. 
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In Sections 2.2 and 2.3 we introduce affine geometry and consider its prop- 
erties. In particular, we show that affine transformations map straight lines to 
straight lines, map parallel lines to parallel lines, and preserve ratios of lengths 
along a given line. We also discover that in affine geometry all triangles are 
congruent, in the sense that any triangle can be mapped onto any other triangle 
by an affine transformation. This result is known as the Fundamental Theorem 
of Affine Geometry. 

In Section 2.4 we establish two important theorems, due to Ceva and 
Menelaus, which involve ratios of lengths along the sides of a triangle. 

Finally, in Section 2.5 we investigate the effect of affine transformations on 
conics, and discover that we can use the methods of affine geometry to obtain 
very simple proofs of certain types of theorems about conics. 


2.1. Geometry and Transformations 


Before embarking on a study of various other geometries, it is useful first to 
look back at our familiar Euclidean geometry. 


2.1.1. What is Euclidean Geometry? 


To help us answer this question, we begin by considering the following well- 
known result. 


Example 1 Let AABC be a triangle in which ZABC = ZACB. Prove that 
AB = AC. 


Solution First, reflect the triangle in the perpendicular bisector of BC, so 

that the points B and C change places and the point A moves to some point 

A’, say. Since reflection preserves angles, it follows that ZA’BC = ZACB. 
Also, we are given that ZACB = ZABC, so 


ZA'BC = ZABC. 
But this can happen only if A’ lies on the line through A and B. Similarly, 
ZA'CB = ZABC = ZACB, 


so A’ must also lie on the line through A and C. This means that A’ and A 
must coincide. Hence the line segment AB reflects to the line segment AC, and 
vice versa. Since reflection preserves lengths, it follows that AB = AC. 


Problem 1 Let A and B be two points on a circle, and let the tangents 

to the circle at A and B meet at P. Prove that AP = BP. 

Hint: Consider a reflection in the line which passes through P and the 
centre of the circle. 
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Geometry and Transformations 


The result in Example | is concerned with the properties of length and angle 
associated with the triangle AABC. To investigate these properties, we intro- 
duced a reflection that enabled us to compare various lengths and angles. We 
were able to do this because reflections leave lengths and angles unchanged. 

Of course, reflections are not the only transformations that preserve lengths 
and angles: other examples include rotations and translations. In general, 
any transformation that preserves lengths and angles can be used to tackle 
problems which involve these properties. In fact, we need worry only about 
leaving distances unchanged, since any transformation from R* onto R? that 
changes angles must also change lengths. Transformations that leave distances 
unchanged are called isometries. 


Definition An isometry of R? is a function which maps R* onto R? and 
preserves distances. 


In fact, every isometry has one of the following forms: 


a translation along a line in R?; 
a reflection in a line in R?: 
a rotation about a point in R?; 


a composite of translations, reflections and rotations in R?. 


The composite of any two isometries is also an isometry, and so it is easy to 
verify that the set S(R7) of isometries of R* forms a group under composition 
of functions. These observations can be used to build up the transformations 
we need in order to prove Euclidean results. 


Example 2 Prove that if AABC and ADEF are two triangles such that 
AB=DE, AC=DF and ZBAC= ZEDF, 


then BC = EF, ZABC = ZDEF and ZACB = ZDFE. 


Solution It is sufficient to show that there is an isometry which maps AABC 
onto ADEF. We construct this isometry in stages, starting with the translation 
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This is because, once we 
know the lengths of the 
sides of a triangle, the 
angles are uniquely 
determined. 


The identity isometry can 
be regarded as a rotation 
through an angle that is a 
multiple of 277. 
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which maps A to D. This translation maps AABC onto ADB’C’, where B’ and 
C’ are the images of B and C under the translation. 


E 


translate - 


A Ds 
an rotate 
x ie Ns . 
‘ /. ™ Cc Pies eeOsvcasesces ae ls 


Since we are given that DF = AC, and since the translation maps AC onto 

DC’, it follows that DF = DC’. We can therefore rotate the point C’, about D, 

until it coincides with the point F. This rotation maps ADB’C’ onto ADB" F, 

as shown in the margin, where B” is the image of B’ under the rotation. 
Finally, notice that 


ZFDE = ZCAB (given) 
= ZC'DB’ (translation) 
= ZFDB" (rotation), 


so either B” lies on DE or the reflection of B” in the line FD lies on DE. Also 
DE=AB (given) 
= DB’ (translation) 
= DB" (rotation). 
It follows that either B” coincides with E or the reflection of B” in the line 
FD coincides with E. 
So, composing the translation, the rotation, and (if necessary) a reflection, 
we obtain the required isometry that maps AABC onto ADEF. Since isome- 


tries preserve length and angle, it follows that BC = EF, ZABC = ZDEF and 
ZACB = ZDFE. 


Problem 2 Prove that if AABC and ADEF are two triangles such that 
AC=DF, ZBAC=ZEDF and ZACB= ZDFE, 
then BC = EF, AB = DE and ZABC = ZDEF. 


We can now answer the question ‘What is Euclidean geometry?’. Euclidean 
geometry is the study of those properties of figures that are unchanged by the 
group of isometries. We call these properties Euclidean properties. Roughly 
speaking, a Euclidean property is one that is preserved by a rigid figure as 
it moves around the plane. Of course, these properties include distance and 
angle, but they also include other properties such as collinearity of points and 
concurrence of lines. 
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Of course, A and D may 
already coincide, in which 
case we omit the 
translation stage. 


If C’ already coincides 
with F’, then we omit the 
rotation stage. 


E 
F reflect 
D; Pa 


We consider only 
Euclidean geometry in the 
plane R2. 


Geometry and Transformations 


This idea, that geometry can be thought of in terms of a group of transforma- 
tions acting on a space, is known as the Kleinian view of geometry. It enables us 
to generate many geometries, without losing sight of the relationship between 
them. 

When we consider geometries in this way, it is often convenient to have 
an algebraic representation for the transformations involved. This not only 
enables us to solve problems in the geometry algebraically, but also provides 
us with formulas that can be used to compare different geometries. 

In the case of Euclidean geometry, perhaps the easiest way to repre- 
sent isometries algebraically is to use matrices. For example, the function 


defined by 
x cos@ —siné x e 2 
t: ; syyER ) 1 
()> (ine “eno )G)+(G) (ener) © 
is an isometry because it is the composite of an anticlockwise rotation through 
an angle 6 about the origin, followed by a translation through the vector (e, f). 


Similarly, the function 
x cos 6 sin @ x e 2 
a ; x, R ) 2 
@Gaer oe) (3) +(4) (@ ye (2) 
is an isometry because it is the composite of a reflection in the line through 
the origin that makes an angle 6/2 with the x-axis, followed by a translation 
through the vector (e, f). 
Remarkably, we can represent any isometry by one or other of the forms 
given in (1) and (2). To see this, notice that any isometry ¢ can be written in the 
form 


t(x) = 1o(x) + (e, f) (x é R’) (3) 


where fg is an isometry which fixes the origin. Indeed, if we let (e, f) = ¢(0), 
then we can let fp be the transformation defined by fo(x) = f(x) — (e, f). This 
is an isometry because it is the composite of the isometry ¢ and the translation 
through the vector —(e, f). It fixes the origin since t9(0) = t(0) — (e, f) = 0. 

Now an isometry that fixes the origin must be either a rotation about the 
origin, or a reflection in a line through the origin. If fo is a rotation about the 
origin, then (3) can be written in the matrix form given in (1), whereas if fo is 
a reflection in a line through the origin, then (3) can be written in the matrix 
form given in (2). 

So together, equations (1) and (2) provide us with an algebraic representa- 
tion of all possible isometries of the plane. The next problem indicates how we 
can obtain a more concise description of this algebraic representation by using 
orthogonal matrices to combine equations (1) and (2). 


Problem 3 Show that both the matrices 
cos@ —sind aad cos @ sind 
sin @ cos @ sind —cos@)’ 


which appear in (1) and (2), are orthogonal for each real number 0. 
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Here a space is simply a 
collection of points; for 
example, the plane is 
R? = {(x,y): 

x,y are real}. 


Recall that the effect of 
the matrix multiplication 
in (1) and (2) can be 
interpreted geometrically 
by examining what 
happens to the vectors (1, 
0) and (0, 1). For example, 
in (2) the matrix 
multiplication sends (1, 0) 
and (0, 1) to (cos 6, sin@) 
and (sind, — cos @), 
respectively, so it 
corresponds to the 
reflection shown in the 
figure below. 


ya 
(0,1) @ 


(cos 6, sing) 


(sin @, —cos @) 


Recall that a matrix U is 
orthogonal if u-l=uv’, 
that is, if UU = L. This 
is equivalent to saying that 
the columns of U are 
orthonormal. 
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By applying the solution of Problem 3 to equations (1) and (2), we see that 
every isometry ¢ has an algebraic representation of the form 


t(x) = Ux+a, 


where U is an orthogonal 2 x 2 matrix, and a is a vector in R?. 


Definition A Euclidean transformation of R? is a function ¢ : R* > R? 
of the form 

t(x) = Ux+a, 
where U is an orthogonal 2 x 2 matrix and a € R?. The set of all Euclidean 
transformations of R? is denoted by E (2). 


We may summarize the discussion above by saying that every isometry of 
the plane is a Euclidean transformation of R?. 

In fact, the converse is also true, for if U is any orthogonal matrix, then 
its columns are orthonormal. In particular, its first and second columns have 


0 
unit length and can therefore be written in the form ee ) and a ‘) 
sin @ sing 


respectively, for some real 6, @. For these to be orthonormal, we must have 
cos 6 -cos @+sin 6 - sind = 0, so that tan@-tang = —l and hence d =0+%. 
It follows that the second column must be 


cos(6+7/2)\ _ ( —siné a 
sin(6 + 1/2) } cos 0 
cos(@6 — 7/2)\ _ sin@ 
sin(@ — 2/2) ) \ —cosé )° 
cos@ —sin@ cos 8 sin @ 
v= (2 ) a v= (% Beak 
It follows that every Euclidean transformation t(x) = Ux + a of IR? has one 


of the forms given in equations (1) and (2). Since both of these forms represent 
isometries of the plane,we have the following theorem. 


So 


Theorem 1 Every isometry of IR? is a Euclidean transformation of R?, 
and vice versa. 


Now the set of all isometries of R* forms a group under composition of 
functions, so it follows from Theorem | that the same must be true of the set of 
all Euclidean transformations of R?. We therefore have the following theorem. 


Theorem 2 The set of Euclidean transformations of R? forms a group 
under the operation of composition of functions. 
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This equation shows the 
matrix U acting on the 
vector x = (x, y). Strictly 
speaking, U acts on the 


coordinates (G ) of x 


with respect to the 
standard basis of the 
vector space R? as in 
equations (1) and (2). 
However, since the 
numbers x and y are the 
same for the vector and its 
standard coordinates, no 
confusion should arise. 


It is not difficult to prove 
that this is a group, so we 
omit the proof. 


Geometry and Transformations 


It is instructive to check the group axioms algebraically, for in the process 
of doing so we obtain formulas for the composites and inverses of Euclidean 


transformations. 
We start by considering closure. Suppose that t, and fg are two Euclidean 


transformations given by 
t(x) = Ujx+a; and (x) = Upx+ ap, 


where Uj and U2 are orthogonal 2 x 2 matrices. Then the composite ft 0 f2 is 


given by 


ty 0 f2(x) = t)(U2x + az) 
= U;(U2x + a) + ay 
= U,U2x + (Ujaz + a)). 


This is a Euclidean transformation since U; U2 is orthogonal. Indeed, 
(UjU2)? = US UT = U;'U;! = WyU2)!. 


So the set of Euclidean transformations is closed under composition of 


functions. 


Problem 4 Let the Euclidean transformations t; and ft of R? be 


given by 
t(x) = ( 


h(x) = (" 


Determine f 0 f2 and f7 of}. 


We Uw 


and 


Uw Ul 

Ue Ulw 

——_——* 
ws 
+ 

fo 
| 

re N 

ca 


Next recall that under composition of functions the identity is the transfor- 
mation given by i(x) = x. This is a Euclidean transformation since it can be 


written in the form 
i(x) =Ix+0, 


where I is the 2 x 2 identity matrix, which is orthogonal. 
The next problem asks you to show that inverses exist. 


Problem 5 Prove that if t; is a Euclidean transformation of R? 
given by 


t(x)=Ux+a  (xeR?’), 


Here we are using the 
result that 
(AB)? = B7 A’. 
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then: 


(a) the transformation of R? given by 
to(x) =U'x—U'a (xe R’) 


is also a Euclidean transformation; 
(b) the transformation fz 1s the inverse of f1. 


The solution of Problem 5 shows that we can calculate the inverse of a 
Euclidean transformation by using the following result. 


The inverse of the Euclidean transformation f(x) = Ux + a is given by 


t'(x) = U-'x-— Ua. 


Problem 6 Determine the inverse of the Euclidean transformation 


given by 
3 _4 1 
5 5 
wo=({ y+ (2): 
5 5 


Finally, composition of functions is always associative. So all four group 
properties hold, as we expected. 

Earlier, we described Euclidean geometry as the study of those properties 
of figures that are preserved by isometries. Having identified these isome- 
tries with the group of Euclidean transformations, we can now give the 
equivalent algebraic description of Euclidean geometry. Euclidean geome- 
try is the study of those properties of figures that are preserved by Euclidean 
transformations of R?. 


2.1.2 Euclidean-Congruence 


In the solution to Example 2 we showed that if two triangles AABC and ADEF 
are such that AB = DE, AC = DF and ZBAC = ZEDF, then there is a 
Euclidean transformation which maps AABC onto ADEF. 


/~ 
B C 


The existence of this transformation enabled us to deduce that both triangles 
have the same Euclidean properties. In particular, we were able to deduce that 
BC = EF, ZABC = ZDEF and ZACB = ZDFE. 
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Geometry and Transformations 


In order to formalize this way of relating two figures, we say that two figures 
are congruent if one can be moved to fill exactly the position of the other 
by means of a Euclidean transformation. Loosely speaking, two figures are 
congruent if they have the same size and shape. 

Later we consider congruence with respect to other groups of transforma- 
tions (that is, congruence in other geometries), so if there is any danger of 
confusion we sometimes say that two figures are Euclidean-congruent. 


Definition A figure F; is Euclidean-congruent to a figure F> if there is a 
Euclidean transformation which maps F) onto Fp. 


For example, any two circles of unit radius are Euclidean-congruent to each 
other because we can map one of the circles onto the other by means of a 
translation that makes their centres coincide. 


Problem 7 Which of the following sets consist of figures that are 
Euclidean-congruent to each other? 


(a) The set of all ellipses 

(b) The set of all line segments of length 1 

(c) The set of all triangles 

(d) The set of all squares that have sides of length 2 


Earlier, we emphasized that the Euclidean transformations form a group. 
This is important because it ensures that Euclidean-congruence has the kind of 
properties that we should expect. For example, we should expect every figure 
to be congruent to itself. Also, if a figure F; is congruent to a figure F, then we 
should expect F> to be congruent to F;. We can, in fact, establish the following 
result. 


Theorem 3 Euclidean-congruence is an equivalence relation. 


Proof We show that the three equivalence relation axioms El, E2 and E3 
hold. 


El REFLEXIVE For all figures F in R’, the identity transformation maps F 
onto itself; so Euclidean-congruence is reflexive. 

E2 SYMMETRIC Let a figure F; in R* be congruent to a figure F>, and let f 
be a Euclidean transformation which maps F onto Fy. Then 
the inverse Euclidean transformation t—! maps F2 onto F), 
so that F2 is congruent to F;. Thus Euclidean-congruence is 
symmetric. 

E3 TRANSITIVE Let a figure F; in R* be congruent to a figure Fy, and let 
Fy be congruent to a figure F3. Then there exist Euclidean 
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Of course, there are many 
other Euclidean 
transformations which 
map one of the circles 
onto the other. 


This uses the existence of 
an identity transformation. 


This uses the existence of 
inverse transformations. 
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transformations t; mapping F; onto F>2 and ft mapping Fy 
onto F3. Thus the Euclidean transformation f2 0 ft; maps 
F, onto F3, so that F; is congruent to F3. Hence Euclidean- 
congruence is transitive. 


It follows that Euclidean-congruence is an equivalence relation, because it 
satisfies the axioms El, E2 and E3. |_| 


Problem 8 Prove that if two figures in R? are each Euclidean- 
congruent to a third figure, then they are Euclidean-congruent to each 
other. 


Since Euclidean-congruence is an equivalence relation, it partitions the 
set of all figures into disjoint equivalence classes. Each class consists of 
figures which are Euclidean-congruent to each other, and hence share the same 
Euclidean properties (for example, one class consists of all circles of unit 
radius, another class consists of all equilateral triangles with sides of length 
3, and so on). If we wish to show that two figures have the same Euclidean 
properties, then it is sufficient to show that they are Euclidean-congruent. 

Now Euclidean geometry is just one of several different geometries. Each 
geometry is defined by a group G of transformations that act on a space. 
In general, we say that two figures are G-congruent if there is a trans- 
formation in G which maps one of the figures onto the other. Since the 
only properties used in the proof of Theorem 3 are the group properties of 
Euclidean transformations, the theorem holds also with ‘G-congruent’ in place 
of ‘Euclidean-congruent’. Thus, like Euclidean-congruence, G-congruence 
is an equivalence relation that partitions the set of all figures into disjoint 
equivalence classes. 

This idea of partitioning figures into equivalence classes is central to geom- 
etry. It enables us to distinguish between figures in different equivalence 
classes, without having to worry about the differences between figures in the 
same equivalence class. For example, if we are interested in whether a conic 
is an ellipse rather than a hyperbola or a parabola, but do not care about 
its shape (that is, the ratio of the lengths of its axes), we might choose to 
work with some geometry whose group of transformations makes all ellipses 
congruent to each other — but not congruent to any hyperbola or parabola. 
We describe a group of transformations which defines such a geometry in 
Section 2.2. 


2.2. Affine Transformations and Parallel Projections 


2.2.1 


In Section 2.1 you met a new approach to Euclidean geometry in R? — namely, 
the idea that Euclidean geometry of R? can be interpreted as a space, R?, 


Affine Transformations 
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This uses the closure 
axiom for the group of 
Euclidean 
transformations. 


For example, to show that 
two triangles AABC and 
ADEF have the same 
Euclidean properties, it is 
sufficient to show that 
AB = DE, AC = DF and 
ZBAC = ZEDF, as you 
saw in Example 2. This 
congruence condition is 
frequently used in 
Euclidean geometry. It is 
known as the ‘side angle 
side’ (SAS) condition for 
congruence. 
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together with the group of Euclidean transformations which act on that space. 
Recall that a Euclidean transformation is a function t : R* > R? of the form 


t(x) =Ux+a (xeR?), 


where U is an orthogonal 2 x 2 matrix. Euclidean properties of figures are 
those, like distance and angle, that are preserved by these transformations. 

In this section we meet the first of our new geometries in R? — affine 
geometry. This geometry consists of the space R? together with a group of 
transformations, the affine transformations, acting on R?. 


Definition An affine transformation of IR? is a function t : R* > R? of 
the form 

t(x) = Ax+b, 
where A is an invertible 2 x 2 matrix and b € R2. The set of all affine 
transformations of R? is denoted by A(2). 


Remark 


Note that every Euclidean transformation of R* is an affine transformation 
of IR? since every orthogonal matrix is invertible. (In terms of groups, the 
group of Euclidean transformations of R? is a proper subgroup of the group 
of affine transformations of R?.) This means that all properties of figures that 
are preserved by affine transformations must be preserved also by Euclidean 
transformations. 


Problem 1 Determine whether or not each of the following transfor- 
mations of R? is an affine transformation. 


(a) noo = (| >)x+(5) (6) 1200 = (~$ >)x+(%) 
©@ no=(Z ~4)x+(4) @ue=(_5 )a 


The algebra required to compose affine transformations is similar to the 
algebra that we used to compose Euclidean transformations. 


Problem 2 For the transformations of R* given in Problem 1, deter- 
mine formulas for the following composites. In each case, state whether 
or not the composite is an affine transformation. 


(a) trot (b) f2 0 t4 


We now verify our assertion above that the set of affine transformations 
forms a group. 
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Affine geometry can be 
defined in R”, for any 

n > 2; we restrict our 
attention here to the case 
when n = 2. 
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Theorem 1 The set of affine transformations A(2) forms a group under 
the operation of composition of functions. 


Proof We check that the four group axioms hold. 
Gl CLOSURE Let f; and f2 be affine transformations given by 
t(x)=A,x+b; and f(x) = Aox+bo, 


where A, and A» are invertible 2 x 2 matrices. Then, for 
each x € R?, 


(t 0 f2)(x) = t) (Aox + bz) 
= Aj(Aox+ bo) + by 
= (A; A2)x + (Ajb2 + by). 


Since A; and Ag are invertible, it follows that A;Ag 
is also invertible. So by definition ft; 0 f is an affine 
transformation. 

G2 IDENTITY Let i be the affine transformation given by 


i(x) =Ix+0 (xe R?’), 


where I is the 2 x 2 identity matrix. If ¢ is an affine 
transformation given by 


t(x) =Ax+b (xe R?’), 
then, for each x € R?, 


(t oi)(x) = Adx + 0) + b= Ax+ b= f(x) 


and 
(i ot)(x) = I(Ax+b) +0= Ax+b= r(x). 
Thus foi = iot = t. Hence i is the identity 
transformation. 
G3 INVERSES If tis an arbitrary affine transformation given by 


t(x) =Ax+b (xe R?’), 
then we can define another affine transformation ¢’ by 
t'(x) = A7!x — A7'b. 
Now for each x € R?, we have 
(to t’)(x) = t(A~!x — A7!b) 
= A(A7'x— A7'b) +b 
= (AA~!x — AA~'b) +b 
= (x—b)+b 


= X. 
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Also, 
(t' 0 t)(x) = t'(Ax+b) 
= A7'(Ax+b)—A7!b 
= (A7'Ax+A7'b)— A7'b 
= (x+A'b)—A7'b 
= xX. 


Thus to t/ = t’ ot =i. Hence #’ is an inverse for t. 
G4 ASSOCIATIVITY Composition of functions is always associative. 


It follows that the set of affine transformations A(2) forms a group under 
composition of functions. | 


The above proof shows that we can calculate the inverse of an affine 
transformation by using the following result. 


The inverse of the affine transformation t(x) = Ax + b is given by 


t~!(x) = AT!x— A7'b. 


Problem 3 Find the inverse of the affine transformation 
1 3 4 
nix) = (| 3)x+(_3)- 


Having shown that the set of affine transformations forms a group under 
composition of functions, we now define affine geometry to be the study of 
those properties of figures in the plane R? that are preserved by affine trans- 
formations. These are the so-called affine properties of figures. We begin our 
investigation of affine geometry by considering the three affine properties listed 
below. 


Basic Properties of Affine Transformations 
Affine transformations: 


1. map straight lines to straight lines; 
2. map parallel straight lines to parallel straight lines; 
3. preserve ratios of lengths along a given straight line. 


There are two approaches that we shall use to investigate these properties. 
One approach is to use the definition of an affine transformation to investi- 
gate the properties algebraically; we do this in Section 2.3. First, however, 
we investigate the properties geometrically. We begin to do this in the next 
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subsection by introducing a special type of affine transformation for which 
there is a simple geometric interpretation. 


2.2.2 Parallel Projections 


A parallel projection is a one-one mapping from R? onto itself, defined in the 
following way. First, we think of its domain and codomain as two separate 


copies of R?. 


domain codomain 


Geometrically, we can represent these copies of R? by two separate planes, 
each equipped with a pair of rectangular axes. 


domain copy of IR” codomain copy of R? 


Next we place these planes into three-dimensional space; we denote the 
domain plane by zr; and the codomain plane by 72. 
Now imagine parallel rays of light shining through zr and zr. Each point P 
in the plane zr; has a (unique) ray passing through it, that also passes through a 
point P’, say, in the plane zr. This provides us with a one-one correspondence 
between points in the two planes 7; and 72. We call the function p which maps Of course, since 7) and 


each point P in 7 to the corresponding point P’ in z2 a parallel projection 72 represent copies of R’, 


from z onto >. a parallel projection is 
really a function from R2 


onto itself. In 


TY m2 
Pp 
rN Subsection 2.2.3 we show 
that parallel projections 
are affine transformations. 


If the roles of the planes 2; and zr are reversed, so that 22 becomes the 
domain plane and z; becomes the codomain plane, then we obtain the inverse 
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function p~! which maps points P’ in m2 back to the corresponding points P 


in 7. Clearly, p~! is a parallel projection of zz onto 7}. 

Each choice of location for the domain plane 7, and the codomain plane z2, 
and each choice of direction for the rays of light, yields a parallel projection. 
The only constraint is that the rays of light must not be parallel to either plane. 

If the planes zr; and zz are parallel to each other, then any parallel projection 
p from zr; onto z2 is an isometry, since the distance between any two points is 
unaltered. 


P 


You can envisage the 
mapping p from zr; onto 
7 as ‘sliding zr parallel 
to itself along the family 
of rays’. 


On the other hand, if the planes are not parallel to each other, then some 
distances are changed under the projection, and so the parallel projection is 
not an isometry; notice, however, that distances along the line of intersection 
of the planes zr; and zr2 do remain unchanged by the parallel projection. 


BP ae 


eS, 


Although distances are not always preserved by a parallel projection, there 
are some basic properties that are preserved; three of these are listed below. As 
you will see, these are the same as the basic affine properties that we mentioned 
at the end of Subsection 2.2.1. 


Basic Properties of Parallel Projections 
Parallel projections: 


1. map straight lines to straight lines; 
2. map parallel straight lines to parallel straight lines; 
3. preserve ratios of lengths along a given straight line. 
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Later, we will show that each basic affine property follows directly from the 
corresponding property for parallel projections. In anticipation of this, we first 
show that the properties hold for parallel projections. 


Property 1 A parallel projection maps straight lines to straight lines. 


Proof Let ¢ be a line in the plane 7, and let p be a parallel projection map- 
ping 7 onto the plane 72. Now consider all the rays associated with p that 
pass through €. Since these rays are parallel, they must fill a plane. Call this 
plane z. 


The image of ¢ under p consists of those points where the rays that pass 
through ¢ meet zr2. But these points are simply the points of intersection of z 
with 72. Since any two intersecting planes in R? meet in a line, it follows that 
the image of @ under p is a straight line. a 


Property 2 A parallel projection maps parallel straight lines to parallel 
straight lines. 


_——— as a 


Ly Ly 


Mn, My 


Proof Let ¢; and m, be parallel lines in the plane zy), and let p be a parallel 
projection mapping zr; onto the plane 2. Let 2 and mz be the lines in 2 that 
are the images under p of ¢; and my. 

If 22 and mz are not parallel, they meet at some point, P2 say. Let P; be 
the point of z; which maps to P2. Then P; must lie on both 2; and m,. Since 
£, and m, are parallel, no such point of intersection can exist, which is a 
contradiction. It follows that 22 and m2 must indeed be parallel. | 
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P, is the point p—!(P). 
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Property 3 A parallel projection preserves ratios of lengths along a given 
straight line. 


™ 


Proof Let A, B, C be three points on a line in the plane zy), and let p be a 
parallel projection mapping zr; onto the plane 2. Let P, Q, R be the points in 
grz that are the images under p of A, B, C. We know from Property 1 that P, 
Q, R lie on a line; we have to show that the ratio AB : AC is equal to the ratio 
PQ: PR. 

If the planes 2; and zp are parallel, then the parallel projection p is an 
isometry, and so the ratios AB : AC and PQ : PR are equal, as required. On 
the other hand, if 7; and z2 are not parallel, then we can construct a plane z 
through the point P which is parallel to 21, as shown in the margin. This plane 
intersects the ray through B and Q at some point B’, and the ray through C and 
R at some point C’. So in this case the ratios AB : AC and PB’ : PC’ are equal. 

Now consider APC’R. The lines B’Q and C’R are parallel, since they are 
rays from the parallel projection. Hence B’Q meets the sides PR and PC’ in 
equal ratios. Thus PQ : PR = PB’ : PC’. It follows that PO : PR = AB : AC, 
as required. (i 


Notice, in particular, that if a point is the midpoint of a line segment, then 
under a parallel projection the image of the point is the midpoint of the image 
of the line segment. 

In Subsection 2.2.3 you will see why the basic properties of affine trans- 
formations and of parallel projections are the same, and you will meet some 
further properties of each. 


2.2.3 Affine Geometry 


In this subsection we explore further the ideas of affine geometry and of par- 
allel projection in order to prove two attractive and unexpected results about 
ellipses. Also, we examine the relationship between affine transformations and 
parallel projections. 


Two Results about Ellipses 


First, starting with any chord ¢ of an ellipse, draw all the chords paral- 
lel to € and construct their midpoints. We claim that these midpoints lie 
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We shall give a slightly 
more general form of this 
property in Theorem 4 of 
Subsection 2.3.3. 


We make use of this fact 
in Subsection 2.2.3. 
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on a chord through the centre of the ellipse — that is, on a diameter of 
the ellipse. 


Theorem 2 Midpoint Theorem 
Let ¢ be a chord of an ellipse. Then the midpoints of the chords parallel to 
£ lie on a diameter of the ellipse. 


diameter 


Next, start with any diameter ¢ of an ellipse and construct a second diameter 
m by following the construction used in Theorem 2, as shown below. Then 
repeat the construction starting this time with the diameter m; this might rea- 
sonably be expected to give us a third diameter of the ellipse — but, surprisingly, 
it gives us the diameter £ with which we started. 


£ £ 
CD Ss 


Theorem 3 Conjugate Diameters Theorem 
Let £ be a diameter of an ellipse. Then there is another diameter m of the The directions of these 


ellipse such that: two diameters are called 

conjugate directions, and 
(a) the midpoints of all chords parallel to @ lie on m; the diameters are called 
(b) the midpoints of all chords parallel to m lie on @. conjugate diameters. 


Proofs for the Special Case of a Circle 


We now investigate these theorems for the special case when the ellipse is a 

circle. To prove the Midpoint Theorem in this case, start with a chord €. If Recall that a circle is an 
necessary, rotate the circle to ensure that ¢ is horizontal. It is then sufficient to ellipse with eccentricity 
prove that every horizontal chord is bisected by the vertical diameter, m. 2eI0; 


a—._¥ 
a 
f= 
a 
SEs 
os 
—__” 


To do this note that the circle is symmetrical about m; so, reflection in m 
maps that part of every horizontal chord to the left of m exactly onto the part 
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to the right of m. Since reflection preserves length, these two parts must be the 
same length; in other words, m bisects each horizontal chord, as required. 

What about the Conjugate Diameters Theorem for the special case of the 
circle? 


a” ~~ a 
— . ™ 
aN mn 
fo 
jo 
\ .  / 
Qe ef 
—S—_” 


Start with the horizontal diameter £, and carry out the construction of another 
diameter as in Theorem 2; this yields the vertical diameter m. If we then start 
with the vertical diameter m and repeat the construction, we obtain @, the hori- 
zontal diameter of the circle. So Theorem 3 certainly holds when the ellipse is 
a circle. 


Generalizing the Proof 


We now investigate how the proofs of Theorems 2 and 3 for the circle can be 
turned into proofs for any kind of ellipse. The crucial fact is as follows. 


Theorem 4 Given any ellipse, there is a parallel projection which maps 
the ellipse onto a circle. 


A suitable parallel projection is illustrated below. Here the plane z (initially 
parallel to zr2) has been tilted about the minor axis of the ellipse. Under the 
projection distances which are parallel to the minor axis remain unchanged, 
but distances parallel to the major axis are scaled by a factor which depends on 
the ‘angle of tilt’. By choosing just the right amount of tilt we can ensure that 
the image of the major axis is equal in length to the image of the minor axis, 
thereby ensuring that the image of the ellipse is a circle. 


Ty Wy) 


ellipse 


Both Theorems 2 and 3 may now be proved using the following technique. 
First, map the given ellipse onto a circle, using a suitable parallel projection p. 
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An algebraic proof of a 
related theorem is given in 
Theorem 1 of 

Subsection 2.5.1. 


Algebraically, in terms of 
a suitable coordinate 
system, the mapping 
xh -x, yRhy, 
a 


maps the ellipse 


to the circle x? + y? = b?. 
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Since we have seen that the theorems hold in the case of the circle, we then map 

the circle back to the ellipse, using the inverse parallel projection p~!. Now 

collinearity and parallelism are preserved under a parallel projection, as is the Here we are using 
property of being the midpoint of a line segment, so the above two theorems, Properties 1, 2 and 3 of 


which hold for a circle, must hold also for the ellipse. parallel projections, in a 
crucial way. 


Notice that certain properties of figures, such as length and angle, are not 
preserved under a parallel projection. This is one difference between Euclidean 
geometry and affine geometry. The difference arises because the group of 
affine transformations is larger than the group of Euclidean transformations. 
In general, the larger the group that is used to define a geometry, the fewer 
properties the geometry has. 


Affine Transformations and Parallel Projections 


Earlier we mentioned that a parallel projection is a special type of affine 
transformation. We now show why this is indeed the case. 
First, consider a parallel projection p of a plane z onto a plane 2. For the 
moment, suppose that the planes are aligned so that the origin in 7; is mapped 
to the origin in 72. Since ratios of lengths are preserved along a straight line, Property 3, 
we must have, for any vector v € IR? and any A € R, Subsection 2.2.2 


p(Av) = Ap(y). (1) 


Next, let v and w be two position vectors in 7r;. Their sum, v + w, is found 
from the Parallelogram Law for addition of vectors, as shown in the diagram 
below. The images under p in 72 are p(v) and p(w), and the sum of these two 
vectors is p(v) + p(w). 
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But a parallel projection maps parallel lines onto parallel lines, so it must 
map parallelograms onto parallelograms. Hence it must map the parallelogram 
in 7 onto the parallelogram in z2, and, in particular, it must map v + w to 
D(v) + p(w). We may write this as 


p(v + Ww) = p(v) + p(w). (2) 


It follows from equations (1) and (2) that p must be a /inear transformation of 
R? onto itself. 
Hence there exists some matrix A such that for each v € R?, 


p(v) = Av. (3) 


Since the linear transformation p is invertible, it follows that A is invertible. 

Now suppose that the parallel projection maps the origin in 77 to some point 
B with position vector b in 72, as shown below. If we temporarily construct a 
new set of axes in zr2 that are parallel to the original axes, but which intersect at 
the point B, then with respect to these new axes p(v) = Av for some invertible 
matrix A, as before. To express p(v) with respect to the original axes, we 
simply add on the vector b to obtain 


p(v) = Av+b (4) 
for some invertible 2 x 2 matrix A. 


U3) 


e 


It follows from equation (4) that p must be an affine transformation. 
Theorem 5 Each parallel projection is an affine transformation. 


The converse is false, for it is not true that every affine transformation can 
be represented as a parallel projection. 
For example, consider the so-called ‘doubling map’ of R? to itself given by 


t(v) =2v (ve R’). (5) 
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Remember that 71 and 79 
represent copies of R2. 
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This is an affine transformation, since it can be written in the form f(x) = 
Ax+b with A = 21 and b = 0. However, a parallel projection is either between 
two parallel planes, in which case all lengths are unchanged, or between two 
intersecting planes, in which case distances along the line of intersection are 
unchanged. The doubling map has neither of these properties and so is not a 
parallel projection. 


Observation An affine transformation is not necessarily a parallel 
projection. 


Although the doubling map is not a parallel projection, it is possible to dou- 
ble lengths in R* by following one parallel projection by another: the first 
doubles all horizontal lengths, and the second doubles all vertical lengths. Thus 
the doubling map (5) can be represented as the composition of two parallel 
projections. 

We end this subsection by showing that every affine transformation can be 
expressed as a composition of two parallel projections. 

Recall that any affine transformation t : R? — R? has the form 


t(x) =Ax+b (xeR?’), (6) 


where A is an invertible 2 x 2 matrix. Now, ¢ is not a linear transforma- 
tion unless b = 0, but we can use methods similar to those for linear 
transformations to determine A and b. 

First, it follows from equation (6) that (0) = b; so b is the image of the 
origin under ¢. If we let e and f be the coordinates of (0), then we can write 


a=(0 4) = (4) 


where a, b, c, d are real numbers that have yet to be found. It follows from 
equation (6) that the images under f of the points (1, 0) and (0, 1) are given by 


(2 a)(o)+G)=()+G) 
(2 a) (0) +G)=(G) +5). 


So if, in addition to t(0) = (e, f), we know the points onto which (1, 0) and 
(0, 1) are mapped by tf, then we can determine the values of a,b,c and d. 
Indeed, we have 


(a,c) =t(1,0)—(e, f) and (b,d) =1(0,1)—(e, f). 


and 


It follows that an affine transformation is uniquely determined by its effect 
on the three non-collinear points (0, 0), (1, 0) and (0, 1). We shall return to this 
method of determining affine transformations in Section 2.3. 
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Notice that for an affine 
transformation f, the 
images ¢(1,0), ¢(0, 1) and 
t(0,0) = (e, f) cannot be 
collinear, for if they were, 
then (a, c) and (b, d) 
would be linearly 
dependent, and A would 
not be invertible. 
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So suppose that a given affine transformation t maps the points (0, 0), (1, 0) 
and (0, 1) to three non-collinear points P, Q and R, respectively. In order to 
express f as the composition of two parallel projections p; and p2, we need to 
define p; and p2 in such a way that p2 o p, has the same effect as t on (0, 0), 
(1, 0) and (0, 1). To do this, we first define p; so that it maps (0, 0) to P, (1, 0) 
to Q, and (0, 1) to some point X, say, and then define p2 so that it maps X to 
R while leaving P and Q fixed. 


(0, 1) 


To construct py; we embed its domain plane 1, and its codomain plane z., 
into R? so that the point (0, 0) in 2; coincides with the point P in zr, as shown 
below. It does not matter how this is done, provided that (1, 0) does not lie in 
zt. We then define p; by the family of rays that are parallel to the ray through 
the point (1, 0) in 7) and the point Q in 2. When defined in this way, p, maps 
(0, 0) to P, (1, 0) to Q, and (0, 1) to some point X, as required. 


To construct p2 we embed its domain plane z, and its codomain plane 2, 
into R? so that the points P and Q in coincide with the points P and Q in 
tz, as shown below. Again it does not matter how this is done, provided that 
X does not lie in 72. We then define p2 by the family of rays that are parallel 
to the ray through the point X in z and the point R in 22. Then p2 leaves P 
and Q fixed and maps X to R. 

Overall, the composite p2 o p; of the two parallel projections maps (0, 0), 
(1, 0) and (0, 1) to P, Q and R, respectively. Now p, and pz are affine transfor- 
mations, so p20 p, is also an affine transformation. Furthermore, p20 pj maps 
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Uniqueness then 
guarantees that 


t= p2° pi. 


For clarity, we have 
omitted the axes from the 
plane zr. 


See the figure below. 
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(0, 0), (1, 0) and (0, 1) to the same points as does t. Since such affine transfor- 
mations are unique, it follows that t = p20 p;. We have therefore demonstrated 
the following result. 


Theorem 6 An affine transformation can be expressed as the composite 
of two parallel projections. 


An important consequence of this theorem is that all properties of figures 
that are unchanged by parallel projections must also be unchanged by affine 
transformations. In particular, the three properties of parallel projections that 
we met in Subsection 2.2.2 must, in fact, be affine properties. 


2.3 _ Properties of Affine Transformations 


In the previous section you saw how parallel projections can be used to explore 
affine geometry from a visual point of view. In this section we explore some of 
the same ideas from an algebraic point of view. 


2.3.1 Images of Sets Under Affine Transformations 


We begin by describing how to find the image of a line under an affine 
transformation. To do this, recall that an affine transformation is a mapping 
t : R* — R? given by a formula of the form 


t(x) = Ax+b, (1) 


where A is an invertible 2 x 2 matrix. The set of such transformations forms a 
group, in which the transformation inverse to ¢ is given by 


t!(x) = AT!x— A7'b. (2) 
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Subsection 2.2.1 
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When equations (1) and (2) are used to find images under f, it is easy to 
confuse points in the domain plane with points in the codomain plane, as both 
planes are copies of R*. To avoid such confusion, we often reserve the symbol 
x and the coordinates (x, y) for points in the domain of ¢, and use the symbol 
x’ and the coordinates (x’, y’) to denote the image of x under f. 

With this notation, we may rewrite equations (1) and (2) in the form 


x’ = Ax+b, (3) 
x=A7'y’—Aq'b, (4) 


The next example illustrates how these equations can be used to find the 
image of a line under an affine transformation. 


Example 1 Determine the image of the line y = 2x under the affine 


transformation 
(4 1 2 2 
nix) = (5 i)x+(_1) (xeR’). (5) 


Solution Let (x, y) be an arbitrary point on the line y = 2x, and let (x’, y’) 
be the image of (x, y) under t. Then 


G)=@ JG)+() 


Next we use equation (4) to express (x, y) in terms of (x’, y’). We have 


41 -1 fal 1 _d 2 3 
( ) = ( 7 = and ( 2 2 ( ) — ( 4 ; Recall that the inverse of 
2 1 —1 2 —l 2 =I —4 the invertible matrix 


A= (* i) 


yA 


ay 


Ne) c od 


A all / _3 1 
(4 YO) = 
y —l 2 y 4 ad — be 


d —b 
It follows that under the inverse mapping t~! we have x ( = ) : 


X= 5x — sy -3 and y=—x’+2y' +4. 
Since x and y are related by the equation y = 2x, it follows that x’ and y’ are 


related by the equation 
ax! $2y'+4=2($2'— = 3), 
which simplifies to 
2x’ — 3y' = 7. 


Dropping the dashes, we see that the image of the line y = 2x under f is the 
line 


2x —3y =7. 
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Problem 1 Determine the image of the line 3x — y + 1 = 0 under the 


affine transformation 
1 _3 
. x+ : (x E R’) : 
2 4 


1 
t(x) = ( 2 
-1 


Problem 2 Determine the image of the circle x* + y* = 1 under the 


affine transformation 
1 _3 
f x+ ‘ (x €E R’) ; 
2 4 


1 
t(x) = ( ? 
-1 


The same technique can be used to find the images of other types of figures, 
such as other conics. You will meet some examples of this in Section 2.5. 


2.3.2 


The algebraic approach can also be used to investigate whether there is an 
affine transformation which maps one given figure onto another. Recall that 
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if there is such a transformation, then the two figures are said to be affine- 
congruent. This concept of congruence is important because, as we explained 
in Section 2.1, figures that are affine-congruent to each other share the same 
affine properties. 

In this subsection we prove the remarkable result that all triangles are affine- 
congruent and therefore share the same affine properties. In fact, since a 
triangle is completely determined by its three vertices, the congruence of tri- 
angles follows from the so-called Fundamental Theorem of Affine Geometry 
which states that any three non-collinear points can be mapped to any other 
three non-collinear points by an affine transformation. 

First, recall that in Subsection 2.2.3 we described how the points (0, 0), 
(1, 0) and (0, 1) in R? can be mapped to any three non-collinear points P, Q 
and R by an affine transformation. This transformation is unique in the sense 
that it is completely determined by the choice of P, Q and R. The following 
example should remind you of how such transformations are constructed. 


Example 2 Determine the affine transformation which maps the points (0, 0), 
(1, 0) and (0, 1) to the points (3, 2), (5, 8) and (7, 3), respectively. 


Solution Let t be the affine transformation given by 


eGe dG): y 


Since ¢(0,0) = (3, 2), it follows from (6) that e = 3 and f = 2. 
Next, (1,0) = (5, 8), so it follows from (6) that 


(3) = (6 a) (o)*@)=()+Q) 
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This is very different to 
Euclidean geometry, 
where two triangles are 
congruent only if they 
have the same shape and 
size. 


There the mapping was 
constructed in a geometric 
manner. In this subsection 
we construct the mapping 
algebraically. 
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The first column of the matrix for ¢ is therefore 


(‘)=(s)- (2) = (6). 


Finally, (0, 1) = (7,3), so that 
7\ (a b 0 ns 3\  (b xe 3 
3) \e d 1 2) \d 2) 
The second column of the matrix for f is therefore 
b\ (7 = 3\ (4 
d) \3 2) \isye 


Hence the desired affine transformation is given by 
ped * View 2 4 x 4 3 
“\y 6 1 y Os hag 


In general, if we want to find an affine transformation ¢t of the form 


ee OGG) y 


which maps (0, 0) to p, (1, 0) to q and (0, 1) to r, then we must choose a, b, c, 
d,e and f so that 


p=t(0,0)=€, f), so (e, f) =p; 


q=710,0)=(4,c)+,f), so(ac)=q-p; 
r=1(0,1) = (b,d) +(e, f), so(b,d)=r-—p. 


Notice that any three points p, q and r uniquely determine a transformation 
t of the form (7), but ¢ is affine only if the matrix 


Ce) 


is invertible. Since the columns of A correspond to the vectors q—p and r—p, 
it follows that A is invertible only if the vectors q — p and r — p are linearly 
independent. That is, provided that p, q and r are not collinear. 

So if p, q and r are not collinear, then we can use the following strategy to 
find an affine transformation which maps (0, 0) to p, (1, 0) to q and (0, 1) tor. 


Strategy To determine the unique affine transformation f(x) = Ax+b 
which maps (0, 0), (1, 0) and (0, 1) to the three non-collinear points p, q 
and r, respectively: 


1. take b = p; 
2. take A to be the matrix with columns given by q — p andr — p. 
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Problem 3 Use the strategy to determine the affine transformation 
which maps the points (0, 0), (1, 0) and (0, 1) to the points (2, 3), 
(1, 6) and (3, —1), respectively. 


Problem 4 Use the strategy to determine the affine transformation 
which maps the points (0, 0), (1, 0) and (0, 1) to the points (1, —2), 
(2, 1) and (—3, 5), respectively. 


Notice that the inverse of the transformation in Problem 3 is an affine trans- 
formation which maps the points (2, 3), (1, 6) and (3, —1) to the points (0, 0), 
(1, 0) and (0, 1), respectively. So if, after applying this inverse, we apply the 
affine transformation in Problem 4, then the overall effect is that of a compos- 
ite affine transformation which sends the points (2, 3), (1, 6) and (3, —1) to the 
points (1, —2), (2, 1) and (—3, 5), respectively. 

In a similar way, we can find an affine transformation which sends any three 
non-collinear points to any other three non-collinear points. 


Theorem 1 Fundamental Theorem of Affine Geometry 
Let p, q, r and p’, q’, r’ be two sets of three non-collinear points in R?. 
Then: 


(a) there is an affine transformation t which maps p, q and r to p’, q’ and 
r’, respectively; 
(b) the affine transformation f is unique. 


Proof 


(a) Let t be the affine transformation which maps (0, 0), (1, 0) and (0, 1) to 
the points p, q and r, respectively, and let f2 be the affine transformation 
which maps (0, 0), (1, 0) and (0, 1) to the points p’, q’ and r’, respectively. 
Then the composite t = f2 of, ' is an affine transformation, and it maps p, 
q andr to p’, q’ and r’, respectively. 


4 , 4 ae qt 
bd) e 
er (0,1)¢@ 
ae @ ‘e- _ > 
(0, 0) (1, 0) e;' 
r ep 
bd | 
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We shall use the results of 
these two problems 
shortly, in Example 3. 


(2,3) (0,0) RH (1, —2) 
(1,6) (1,0) (2,1) 
(3, -1) (0,1) & (3,5) 


lk Qo, 
pr (0,0) p 
qh (1,0) q’ 


re Older 
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(b) Suppose that f and s are both affine transformations which map p, q and r 
to p’, q’ and r’, respectively, and let t; be the affine transformation defined 
in part (a). Then the composites ¢ ot; and s of are both affine transforma- 
tions which map (0, 0), (1, 0) and (0, 1) to p’, q’ and r’, respectively. Since 
an affine transformation is uniquely determined by its effect on the points 
(0, 0), (1, 0) and (0, 1), it follows that fot; = sof. 

If we then compose both f ot; and so ¢; on the right with i it follows 
that t = s. Thus the mapping f constructed in part (a) is unique. | 


Now suppose that we are given two arbitrary triangles AABC and ADEF. 
By the Fundamental Theorem there is an affine transformation which maps the 
vertices A, B, C to the vertices D, E,, F, respectively. Since this transformation 
maps straight lines to straight lines, it must map the sides of AABC to the sides 
of ADEF, so we have the following important corollary. This will be used 
extensively in Section 2.4. 


Corollary All triangles are affine-congruent. 


In order to find the affine transformation which maps one triangle, vertex 
to vertex, onto another triangle, we follow the strategy used in part (a) of the 
proof of the Fundamental Theorem. 


Strategy To determine the affine transformation t which maps three non- 
collinear points p, q and r to another three non-collinear points p’, q/ and 
r’, respectively: 


1. determine the affine transformation ¢; which maps (0, 0), (1, 0) and 
(0, 1) to the points p, q and r, respectively; 

2. determine the affine transformation t2 which maps (0, 0), (1, 0) and 
(0, 1) to the points p’, q’ and r’, respectively; 

3. calculate the composite t = fy 0 ft, a 


Example 3 Determine the affine transformation which maps the points (2, 3), 
(1, 6) and (3, —1) to the points (1, —2), (2, 1) and (—3, 5), respectively. 


Solution You have already seen in Problem 3 that the affine transformation 
t; which maps the points (0, 0), (1, 0) and (0, 1) to the points (2, 3), (1, 6) and 
(3, —1), respectively, is given by 


1) = a 4) s+ (5): 


89 


tors , 


(0,0) pS" p 
(0H q rh q 


ODerry 


Recall that the previous 
strategy explained how 1 
and f can be determined. 
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Also, in Problem 4 you saw that the affine transformation tf) which maps 
the points (0, 0), (1, 0) and (0, 1) to the points (1, —2), (2, 1) and (—3, 5), 
respectively, is given by 


wo-(S (2) 


Following the strategy, we need to find the inverse of t;. We have 
—- iy" f=4 41 
3. -4 ~\-3 -1 
-4 -1 2 
=3 =1 3 


so that the inverse of f; is given by 
—-4 -1 11 
-1 
ty @) = (73 “1 )x+( ay 


Thus the affine transformation which maps the points (2, 3), (1, 6) and 
(3, —1) to the points (1, —2), (2, 1) and (—3, 5), respectively, is given by 


and 


ll 
~~ 
fe 
Nowa 
ee 


t(x) = tot; |(x) 
ap (fh F\ga(" 
Se: i 9 
_f{1 —-4 —-4 -1 ih 11 ns 1 
“AB 4 —3 -l 9 —2 
7 e 3) n/a 1 
~ \\-33 -10 96 —2 
_ 8 3 ox —24 
~ \ 33 —10 94 )° 
Problem 5 Determine the affine transformation which maps the 


points (1, —1), (2, —2) and (3, —4) to the points (8, 13), (3, 4) and 
(0, —1), respectively. 


2.3.3 Proofs of the Basic Properties of Affine Transformations 


In Subsection 2.2.2 we used parallel projections to demonstrate that affine 
transformations have the following basic properties: they map straight lines 
to straight lines, they map parallel lines to parallel lines, and they preserve 
ratios of lengths along a given straight line. We now give algebraic proofs of 
these assertions. 
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Theorem 2 An affine transformation maps straight lines to straight lines. 


Proof 


ptia t(p) + A(Aa) 


L 1(t) 
Let ¢ be a line through a point with position vector p, and let the direction of £ 
be that of some vector a. Then 
£={pt+aa:A€ R}. 
Now let t : R? —> R? be an affine transformation given by 
t(x) = Ax+b. 
We can find the image under ¢ of an arbitrary point p + Aa on £ as follows: 
t(p+ Aa) = A(p+aAa)+b 
= (Ap+ b) +AAa 
= t(p) + AAa. 
So the image of ¢ is the set 
t(€) = {t(p) + AAa: A € R}, 


which is a line through ¢(p) in the direction of the vector Aa. Oo 


Theorem 3 An affine transformation maps parallel straight lines to paral- 
lel straight lines. 


Proof 


t(q) + A(Aa) 


t(p) + A(Aa) 


1(L5) 


1(L4) 


Let £; and 2 be parallel lines through the points with position vectors p and 
q, respectively, and let the direction of the lines be that of the vector a. Then 


2) ={ptdaa:rAeR} and 0) ={q+dAa:rA ER}. 
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As in the proof of Theorem 2, the images of ¢; and 2 under the affine 
transformation f(x) = Ax + b are the sets 


t(€;) = {t(p) +AAa:A eR} and t(l2) = {t(q) +AAa: A € R}. 


These sets are straight lines which pass through the image points ¢(p) and t(q), 
both in the same direction as that of the vector Aa. Hence the two image lines 
under f are parallel, as claimed. |_| 


Rather than prove that affine transformations preserve ratios of lengths along 
a given straight line, as in Property 3 of Subsection 2.2.2, we prove the follow- 
ing more general result illustrated in the margin. The original result follows 
because any line is parallel to itself. 


Theorem 4 An affine transformation preserves ratios of lengths along 
parallel straight lines. 


Proof We begin by examining what happens to the length of a line segment 
under an affine transformation. 


Let £ be a line through a point with position vector p, and let the direction of @ 
be that of some unit vector a. Then 


€={pt+adAa:ir€ R}. 
As in the proof of Theorem 2, the image of @ under the affine transformation 
t(x) = Ax + bis the line 
t(€) = {t(p) + AAa: A € R}. 


Now consider a segment of ¢ with endpoints p + A,a and p + Aga. Since a is 
a unit vector, the length of the segment is 


||(p + Aza) — (p + Aja)|| = |A2 — Ail: [lal] = |A2 — Ail. 


The image of the segment has endpoints t(p) + 4; Aa and ¢(p) + A2Aa, so the 
image of the segment has length 


|(t(p) + A2Aa) — (t(p) + A; Aa)|| = |A2 — Ai] - ||Aal]. 


So, in the process of mapping segments along £ to segments along f(£), lengths 
are stretched by the factor ||Aa||. Since this factor is the same for all segments 
which lie along lines parallel to a, it follows that the ratios of lengths along 
parallel lines are unchanged by f. 5 
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~) 


preserves ratio: 
AB/CD =A'B'/C'D' 


Recall that ||a|| means the 
length of a. 
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2.4 Using the Fundamental Theorem of Affine Geometry 
In this section we explain how the Fundamental Theorem of Affine Geometry 
can be used to deduce the fact that the medians of any triangle are concurrent 
from the special case that the medians of an equilateral triangle are concur- 
rent. We then use similar methods to prove the classical theorems of Ceva and 
Menelaus. 


2.4.1. The Median Theorem 


Let AABC be an arbitrary triangle in the plane. If you join the midpoint of each 
side of the triangle to the opposite vertex (these lines are called the medians 
of the triangle), these three lines appear to pass through a single point. In fact, 
no matter what triangle you choose, you find that its medians meet in a single 
point. 


Theorem 1 Median Theorem 
The medians of any triangle are concurrent. 


We can get some evidence that this theorem holds in general by looking first 
at a special case where a proof of the theorem is straight-forward — namely, 
when the triangle is an equilateral triangle. 

To do this, consider an equilateral triangle AABC, with medians AP, BQ and 
CR. Since AABC has sides of equal length, it must be symmetric about the line 
AP. Thus the point at which BQ meets CR must be symmetrically placed with 
respect to this line — that is, it must actually lie on the line AP. In other words, 
the lines AP, BQ and CR are concurrent if the triangle is equilateral. 

In order to show that the medians of an arbitrary triangle meet at a point, 
consider an arbitrary triangle AABC, and let P, Q and R be the midpoints of 
the sides BC, CA and AB, respectively. Next, choose a particular equilateral 
triangle AA’B’C’, and let P’, Q’ and R’ be the midpoints of the sides B’C’, 
C’A’ and A’B’, respectively. 


According to the Fundamental Theorem of Affine Geometry there is an 
affine transformation t which maps AABC onto AA‘B'C’. Moreover, since 
affine transformations preserve ratios of lengths along lines it follows that rf 
maps the mid-points P, Q and R to the mid-points P’, Q’ and R’, respectively. 
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These results are named 
after Giovanni Ceva 
(Italian mathematician, 
1647/48-1734) and 
Menelaus of Alexandria 
(Greek geometer, Ist 
Century AD). 


This technique of looking 
first to see whether a 
result holds in a special 
case is often useful. 


B Cc 


94 2: Affine Geometry 
From the above discussion we know that the medians of any equilateral 


triangle meet at a point, so in particular we know that A’P’, B’Q’ and C’R’ 
meet at some point X’, say, as shown on the right below. 


affine 


The trick now is to observe that ¢ has an inverse t~! which is also an affine 
transformation. This inverse maps the medians A’ P’, B’Q’ and C’R’ back to 
the medians AP, BQ and CR of the original triangle AABC. Since X’ lies on 
all three of the lines A’ P’, B’Q’ and C’ R’ it follows that t~! maps X’ to some 
point X which lies on all three of the lines AP, BQ and CR. In other words, the 
medians of AABC are concurrent. 
Since AABC is an arbitrary triangle we have proved the Median Theorem. 
The essence of the above proof is the fact that all triangles are affine- 
congruent. That powerful result enables us to prove theorems concerning the 
affine properties of triangles (such as collinearity, lines being parallel, and The basic affine properties 
ratios of lengths along a given line) following a standard pattern. First, we were listed in 
choose a particular type of triangle for which it is easy to prove the result. Subsection 2.2.1. 
Then, by asserting the existence of an affine transformation from that triangle 
to an arbitrary triangle, we deduce that the result holds for all triangles. 
This is the approach we shall use to prove the theorems of Ceva and 
Menelaus later in the section. 


2.4.2 Ceva's Theorem 


We now prove the following theorem due to Ceva. 


Theorem 2 Ceva’s Theorem 
Let AABC be a triangle, and let X be a point which does not lie on any of 
its (extended) sides. If AX meets BC at P, BX meets CA at Q and CX meets 
BA at R, then R 
Ak BP CO. 1 Q 
RB UPC OA a 


Proof According to the Fundamental Theorem of Affine Geometry there is an 
affine transformation ¢ which maps the points A, B, C to the points A’ = (0, 1), 
B' = (0,0), C’ = (1,0), respectively. This transformation maps the triangle 
AABC onto the right-angled triangle AA’B’C’, and it maps the point X to 
some point X’ = (u,v). 
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A“(0, 1 
A (0, 1) 
R R’ Q 
Q X“(u, V) 
B c 
f B(0,0) PP” c’(1, 0) 


Using coordinate geometry we can calculate the equations of the lines A’X’, 
B'X’, C’X’ and hence find the coordinates of the point P’ where A’X’ meets 
B'C’, of the point Q’ where B’X’ meets A’C’, and of the point R’ where C’X’ 
meets A’ B’. 

Starting with the point P’, we note that the line B’C’ has equation y = 0. 


Also, the line A’X’ has slope a so its equation is y — 1 = ie (x — 0). 


Hence, at the point P’ where the two lines meet, we must have y = 0 and 


y-l=4 7% = 0), 80: 
; ( u ) 
l-v’ : 


Similarly, at the point R’ we have x = 0, and y —-0 = 


R= (0 a ). 
l—-—u 


Finally, at Q’ we have x + y = land y = 2x, sox = 


pa u Vv 
Q -(4..4). 


Thus, using the coordinate formulas for calculating ratios we obtain These formulas are given 


AUR’ ee | 1 at the beginning of 
zs — YR YA" _ Tu = = Appendix 2 just above the 


Tee — 1), so 


u 
u+v 


Uv 
u+v~" 


and y = 
Hence 


RB yy —yr 0-7 —v °° Section Formula. 
BE sexy jeg =? u 
PO s¢—ap l= >,  1l=-a@=u 
and 
UV 
C'Q' yo-yo mo _v 
= = = =-. 
QA’ ya yo 1- a 
Hence 


A’R’ B’ P’ C’ Q’ 
R’'B’ : P’C’ : Q'A’ = 
Since t~! is an affine transformation, it preserves ratios along a line. It must 


therefore map P’, Q’, R’ back to the points P, Q, R in such a way that 
AR BP CO _ 


RB PC QA’ 


as required. 0 
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The next example illustrates how we can use Ceva’s Theorem to calculate 
certain unknown distances along the sides of a triangle. For the method to work 
correctly, it is important to remember that all the ratios in Ceva’s Theorem are 
signed ratios. Thus, if X lies inside the triangle, as in part (a) of the example, 
then all the ratios are positive. But if X lies outside the triangle, as in part (b), 
then two of the ratios will be negative. 


Example 1 


(a) In the figure on the left below, AR = 1, RB = 2, BP = 3, CQ = 2 and 
QA = 2. Calculate the distance PC. 

(b) For the figure on the right, AR = 1, AB = 3, PC = 1, CQ = 2 and 
QA = 2. Calculate the distance BC. 


Solution 


(a) By Ceva’s Theorem, we have 


AR BP CQ_.. 
SO, 
Le 
2.PC 2 
It follows that PC = 3. 
(b) By Ceva’s Theorem, we have 
en BE OES ote 
RB PC QA , 


so, 


1 BC+1 2 
4 1 2. 


It follows that BC = 3. 


Problem 1 


(a) Determine the ratio ee in the left diagram below, given that 


AR AQ 3 


RB OC 2 
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etermine the ratio >= 1n the middle diagram below, given that 
(b) Determine th io Gy in th iddle diagram below, given th 


AR 1 BP 2 
RB 2 PC” 7 
(c) Determine the ratio as in the right diagram below, given that 
BP 5 c 
—=- and cg =—7, 
PC 7 QA 


Ceva’s Theorem has the following converse, which can be regarded as a 
generalization of the Median Theorem to configurations where P, Q, R are 
not all midpoints of sides. 


Theorem 3 Converse to Ceva’s Theorem 
Let P, Q and R be points, other than vertices, on the (possibly extended) 
sides BC, CA and AB of a triangle AABC, such that 

AK BP CO. 


RB PC Cian ) 


Then the lines AP, BQ and CR are concurrent. 


Proof Let the lines BQ and CR intersect at a point X, and let the line AX meet 
BC at some point P’. It is sufficient to prove that P = P’. 
It follows from Ceva’s Theorem that 
AR BP C 
— ee = 1, (2) 
RB P’C QA 
Hence, from equations (1) and (2), we have 


BP _ BP 
Fo PC 
so that P and P’ must indeed be the same point. | 


Example 2 The triangle AABC has vertices A(1,3), B(—1,0) and C(4,0), 
and the points P(0,0), 0 (3,4) and R (—3,4) lie on BC, CA and AB, 
respectively. 


(a) Determine the ratios in which P, Q and R divide the sides of the triangle. 
(b) Determine whether the lines AP, BQ and CR are concurrent. 
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In the Median Theorem, 


AR BP 
Rp PC”? 
OF aj 

QA” 


so Theorem 3 generalizes 
the Median Theorem. 
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Solution 


(a) Using the coordinate formulas for calculating ratios, we obtain 


AR _xr-*%a _ 7-371 _, BP xp—xp O0+1 1 
RB a3=29° 2142.~ : PC xc—xp 4-0 4 
CO 80-86 524 4 (3) 
QA x4 —XQ 1-§ 5° 


so that P divides BC in the ratio | : 4, Q divides CA in the ratio 4 : 5 and 
R divides AB in the ratio 5: 1. 
(b) It follows from (3) that the product 


AR BP CQ_._ 1 4_ 
RE PC OA AS 


15 


so by the converse to Ceva’s Theorem the lines AP BQ and CR must be 
concurrent. 


Problem 2 The triangle AABC has vertices A(—1, 1), B(2,—1) and 
C(3,2), and the points P($,1), Q (2,4) and R(Z,—4) lie on BC, CA 
and AB, respectively. 


(a) Determine the ratios in which P, Q and R divide the sides of the 
triangle. 
(b) Determine whether the lines AP. BQ and CR are concurrent. 


2.4.3. Menelaus’ Theorem 


Ceva’s theorem is concerned with lines through the vertices of a triangle that 
meet at a point. We now use the Fundamental Theorem of Affine Geometry to 
prove an analogous theorem due to Menelaus which is concerned with points 
on the sides of a triangle that are collinear. 


Theorem 4 Menelaus’ Theorem 
Let AABC be a triangle, and let @ be a line that crosses the sides BC, CA, 
AB at three distinct points P, Q, R, respectively. Then 


AR BP CQ _ , a 
RBI OPC OA Bi B c 


Proof According to the Fundamental Theorem of Affine Geometry there is 
an affine transformation t which maps the points A, B, C to the points A’(0, 1), 
B’(0,0), C’(1, 0), respectively. This transformation maps the triangle AABC 
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onto the right-angled triangle AA’B’C’, and it maps the line £ to some line ¢’. 
Let the equation of €’ be y = mx-+c. 


affine 
‘i A“(0, 1) 
A 
R’ 
R F 
B P 
c B’(0, 0) c’(1, 0) 


We now calculate the coordinates of the points P’, Q’ and R’ where ¢’ meets 
the sides B’C’, C’A’ and A’B’, respectively. 
At P’ we have y = 0 and y = mx-+-c. This implies that x = — ©, and hence 


F e 
P’ = (-=,0). 
m 
At R’ we have x = O and y = mx + c. This implies that y = c, and hence 
R’ = (0, c). 


At Q’ we have x + y = | and y = mx-+c. This implies that 1 — x = mx+c 


so that x = +=; also y = m(1 — y) +c, so that y = rr; and hence 


m+1 


Q' = l-—c m+c 
~\m+lmti)° 


Using the coordinate formulas for calculating ratios we obtain 


A'R' _ yr'—yar _e-1_ c—1 


R'B’ "yp — yr ~ O0-c —c 


BP’ ap 4a ee =e 
PIC’ xq —xp 148 mt+c 


and ; 
co XQ’ — XC at = —(m +c) 
OA xar—xq ORF a4 c-1 
Hence, 


A'R’ B'P’ CQ 
RB PIC Q'Al 
Since t~! is an affine transformation, it preserves ratios along a line. It must 
therefore map P’, Q’, R’ back to the points P, Q, R in such a way that 
AR BP CQ 
RB PC OA 


as required. | 


1. 


1, 


Remark 


As for Ceva’s Theorem, it is important to remember that all the ratios in 
Menelaus’ Theorem are signed ratios. In fact if £ passes through the interior of 
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the triangle, then precisely one of the ratios is negative; otherwise all the ratios 


are negative. 


Example 3 

(a) In the figure on the left below: AR = 1, RB = 2, BC = 2, CQ = | and 
QA = 1. Calculate the distance PC. 

(b) In the figure on the right below: AR = 2, AB = 1, BC = 2, CA = 2 and 
BP = 2. Calculate the distance QA. 


Solution 

(a) By Menelaus’ Theorem, we have 
AR BP CQ 
RB PC OA 


1 2+PC a 
2 PC 1 - 


It follows that 2 + PC = 2PC, and hence PC = 2. 
(b) By Menelaus’ Theorem, we have 


AR BP CQ _ 
RB PC QA 


(-3)-Ca)-gr) = 


It follows that 2+ QA = 3QA, and hence QA = 1. 


So 


1. 


Problem 3 
(a) Determine the ratio re in the left diagram below, given that 


AR BP 
—=2 and —=-2. 
RB PC 


(b) Determine the ratio cO in the right diagram below, given that 
OA g & g 


AR 1 BP 
and 


== —_ = —2, 
RB 4 PC 
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Menelaus’ Theorem has a converse that enables us to check whether points 
on the three sides of a triangle are collinear. 


Theorem 5 Converse to Menelaus’ Theorem 
Let P. Q and R be points other than vertices on the (possibly extended) sides 
BC, CA and AB of a triangle AABC, such that 


AR BP C 
EY TT = (3) 
RB PC QA 

Then the points P Q and R are collinear. 


Proof Let the line ¢ that passes through Q and R meet BC at some point P’. 
It is sufficient to prove that P = P’. 
It follows from Menelaus’ Theorem that 
AR BP CQ _ 
RB P'C QA 


tt, (4) 


Hence, from equations (3) and (4) we deduce that 


BP _ BP 
PC PC 
It follows that P and P’ must indeed be the same point. | 


Problem 4 The triangle AABC has vertices A(2,4), B(—2,0) and 
C(1,0), and the points P(3,0), Q(3,2) and R(1,3) lie on BC, CA and 
AB, respectively. 


(a) Determine the ratios in which P, Q and R divide the sides of the 
triangle. 
(b) Hence determine whether the points P, Q and R are collinear. 


We end this subsection with two revision problems. 


Problem 5 Let AABC be a triangle, and let X be a point which does 
not lie on any of its (extended) sides. Also, let AX meet BC at PR. BX meet 
CA at Q and CX meet BA at R; and let QR and BC meet at T. 

Given that ae = k, determine We in terms of k. 
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The strategy of the proof 
is the same as that of 
Theorem 3. 

A 
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Problem 6 Suppose that P and Q are the midpoints of the sides AB 
and BC of a parallelogram ABCD, and that the lines DP and AQ meet 
at R. 


(a) Determine the image of B under the affine transformation t which 
maps A, D and C to (0, 1), (0, 0) and (1, 0), respectively. 

(b) By considering the image of ABCD under t, determine the ratios 
PR: RDand AR: RQ. 


2.4.4 


In this subsection we introduce a new coordinate system, of barycentric coor- 
dinates with respect to a triangle of reference, for points in the plane, which can 
simplify some calculations. Rather than use two perpendicular axes to deter- 
mine the coordinates of an arbitrary point in the plane, we use a weighted sum 
of the coordinates of three non-collinear points — the vertices of the triangle of 
reference. 


Barycentric Coordinates 


Definitions Let A = (a),a2), B = (bj,b2) and C = (cj,c2) be three 
non-collinear points in the plane R*; we will call AABC the triangle of 
reference. Then a point (x, y) in the plane has barycentric coordinates 
(€,7,¢) with respect to AABC if 

= Gi PH aE ECs 

y = §az + nbz + $c, and (5) 

lSe 49 4ee&. 


Remark 


In particular, notice that the barycentric coordinates of the vertices A, B 
and C of the triangle of reference AABC are (1,0,0), (0,1,0) and (0,0,1), 
respectively. 


Thus, for example, if the triangle of reference AABC has vertices A = (1,2), 
B = (—3,0) and C = (—2,4), then the point with barycentric coordinates 


(3 3, = 1) has cartesian coordinates 


(4-0) +3-€3)-4-2),4-@+3- 


_ (2-942 440-4 
- 4° 4 


— 
| 
Ale 
A 
ad 
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Barycentric coordinates 
were introduced by 
MObius in 1827. 


A(a1,d) 


B(b,,b9) 


C(c},¢2) 


Cartesian coordinates are 
the standard Euclidean 
coordinates in the plane. 
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We may express the formula (5) for barycentric coordinates in terms of 
matrices in the form 


x E aq by cy 
y}=MIn]}], whereM=Ja bo co]. (6) 
1 6 t f. 4 


Since A = (a1, a2), B = (bj, bz) and C = (c},c2) are non-collinear, the deter- 
minant det M is non-zero. Then, since M is non-singular, we can reformulate We do not prove that 


the representation (6) as det M # 0 here. 
é x 
n}=M'|y].- (7) 
a 1 


Example4 Determine barycentric coordinates for the point (2,1) with respect 
to the triangle of reference AABC where A = (1,0), B = (1,—1) and 
C=(-1,1). 


Solution The matrix M for the triangle of reference AABC is 


1 1 -l 
M={0 -1l 1], 
1 1 1 
whose inverse is We omit the details of the 
I it 0 calculation of this inverse. 
1 1 
=3 =1 4 
1 1 
—z 0 3 


It follows from the representation (7) that the point (2,1) has barycentric 
coordinates with respect to the triangle of reference AABC given by 


1 1 0 3 
1 1 

—z ~l xz]/Ji)=|-3 1]; 
1 1 

-5 0 5 1 = 


namely, barycentric coordinates (3, - 3, _ 3). 


Problem 7 Determine barycentric coordinates for the point (—1, 1) 
with respect to the triangle of reference AABC where A = (1,1), B= 
(2,2) and C = 1,2. 


Next, we give barycentric versions of the condition for collinearity of three 
points in the plane and of the equation of a line in the plane. 
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Theorem 6 The points PR Q and R with barycentric coordinates 
(E1,71, 01), (2,2, 62) and (&3, 3, £3) are collinear if and only if 


& & 
m 2 n2| = 0. 
Gi GG 


Proof Let the points P Q and R have cartesian coordinates (x1, y1), (x2, y2) 
and (x3, y3), respectively. It follows that, if the triangle of reference AABC has 
vertices A = (a},da2), B = (bj, b2) and C = (c1,c2), then we may apply the 
formula (5) to each of P. Q and R in turn to obtain 

x) = 4181 + bing + cr) and yy = ag&) + bam + €261, 

X2 = a& + bing + 1b and yo = a2 + bao + c262, 

x3 = a1&3 + bins + c163 and y3 = a2&3 + bon3 + €263. 


We may write these simultaneous equations in matrix form as 


Xt X2 x3 aq b cy & & & The results for P Q and R 
yo v wf =la bh oc nm 2 1 ae out sab antaeie : 2 
and 3, respectively, of the 
: : : : : i S162 $3 left-hand matrix. 
& & & 
=Mjim m mm], (8) 
foo & 
ay bh cy 
whereM=]|a2. bo co 
1 1 1 


Now, M is non-singular since the points A, B and C are non-collinear. Then 
it follows from equation (8) that 


&i & & xX, x2 x3 
nm n2 3} =Oifandonlyif jy, yo y3)=0. 
fi $2 1 1 1 


But the second determinant equation here is the condition for P. Q and R to be 
collinear. The desired result then follows immediately. 5 


Corollary The line ¢ through the points with barycentric coordinates 


(1,1, 61) and (&2, 72, 2) has equation That is, an equation in 
terms of barycentric 
Slee ae coordinates. 


im ap | =O. 
Gl Ge & 
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For example, if the triangle of reference is AABC, the equation of the line 
AB in terms of barycentric coordinates is 


which simplifies to the equation ¢ = 0. 


Problem 8 Determine which of the following sets of points, described 
by their barycentric coordinates with respect to an unspecified triangle 
of reference, are collinear; for those that are collinear, determine the 
equation of the line on which they lie. 


(a) G1,1,=1), 4,—2,=1), (3,2.-3) 
(b) (1,1,—1), (2,—2, 1), (—1, 7, -5) 


Next, we meet a version of the Section Formula in terms of barycentric 
coordinates. 


Theorem 7 Section Formula 

The point R that divides the line @ joining the points P and Q with barycen- 
tric coordinates (&,71,¢,) and (&, 72,2) in the ratio (1 — 4) : A has 
barycentric coordinates 


(3775) = XEin ni. Gi) — ) (Ens 425 @). 


Proof Let P, Q and R have cartesian coordinates (x1, y1), (x2, y2) and (x, y), 


respectively. It follows from the cartesian form of the Section Formula that 
(x,y) = A(x, y1) + A — A) (x2, y2), for some real number A; 


we can rewrite this equation in matrix form as 


+] x2 
yJ=At mum |+a-A)] 
1 1 1 


Multiplying both sides of this equation on the left by the matrix M~! and using 
the formula (8), it follows that 


§ §| &o 
ny) =A;lm | +Qd—-A){ m 
g f1 b2 
This is the desired result. |_| 


We can now use barycentric coordinate methods to give further proofs of 


Menelaus’s Theorem and Ceva’s Theorem. 
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Similarly, BC and CA 
have equations € = 0 and 
n=0. 


The Section Formula is 


given in Appendix 2. 
A 
1-A Q 
R 
P 
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Theorem 8 Menelaus’ Theorem 
Let AABC be a triangle, and let @ be a line that crosses the sides BC, CA 
and AB at three distinct points P Q, R, respectively. Then 


AR BP CQ apes 
ieee ee 
Proof Define i, ~ and v as follows: 
BP =A CQ i1-p AR I=» 
PC 2? OCA w’” RB v 


Hence, by the Section Formula, P has barycentric coordinates 
P =A(0,1,0) + (1 — 4)(0,0, 1) 


(0. k= a), 


Similarly, Q and R have barycentric coordinates (1 — jw, 0, uw) and (v, 1 — 
v, 0), respectively. 
Then, by Theorem 6 the points P. Q and R are collinear if and only if 


Expanding this determinant, we have that P, Q and R are collinear if and only if 


Xr 0 
v 
1-rA pw 


l-—v 


1-2 0 


=(h= 7) 


that is, if and only if 
(—Ay( — wd — v) + Apv = 0, 


or 


From the original definition of 4, ~ and v, it follows that P Q and R are 
collinear if and only if 
AR BP CQ = 
RB PC QA — 


1. 


From our assumption that P Q and R are collinear, the desired result 
follows. |_| 
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For, P divides BC in the 
ratio (l—A):A. 


We omit the details of 
these calculations. 


Notice that we have also 
obtained a proof of the 
converse of Menelaus’ 
Theorem, since this 
equality is an ‘if and only 
if’ result! 
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Theorem9 Ceva’s Theorem 

Let AABC be a triangle, and let X be a point which does not lie on any of 
its (extended) sides. If AX meets BC at P BX meets CA at Q and CX meets 
BA at R, then 


AR BP CQ | 


RB PC OA 


Proof We use the same notation 4, jz and v as in the previous proof, so that 
again P. Q and R have homogeneous coordinates (0,4, 1 — A), (1 — yw, 0, ) 
and (v, | — v,0), respectively. 

Then, using the Corollary to Theorem 6 the equation of AP is 


1 0 E 
O 42‘ n | =0, 
0 1-A ¢ 
which simplifies to 
A 0 
toe Sl 


This gives that the equation of AP is A¢ — (1 — A)n = 0, or 


pe 1-i 
= "a n. 
Similarly, we find that the equation of BQ is € = THe, 

Since AP and BQ meet at X, it follows that the barycentric coordinates 
(€,,¢) of X must satisfy the equations ¢ = An and € = THe; hence 
its barycentric coordinates are 

(- —-r 1l-p 1-2 


. =n , for some 0. 
; reece ") nF 


Then C, X and R are collinear if and only if 


1-r |l-wp 


0 a han n 
0 n 1-—v |=0; 
( 0 

my 


this simplifies to 
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We omit the details of this 
calculation. 
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It follows that C, X and R are collinear — that is, that AP BO and CR are 
concurrent — if and only if 


AR BP CQ _ 

RB PC QA 
From our assumption that AP BQ and CR are concurrent, the desired result 
follows. |_| 


2.5 Affine Transformations and Conics 


2.5.1 Classifying Non-Degenerate Conics in Affine Geometry 


In Section 2.2 you saw that under an affine transformation a straight line maps 
to a straight line. Indeed, it follows from the Fundamental Theorem of Affine 
Geometry that any straight line can be mapped to any other straight line by 
some affine transformation. We now explore the corresponding situation for 
conics. 

Recall that a conic is a set in R* given by an equation of the form 


Ax’ + Bry + Cy’ + Fx+ Gy +H =0, (1) 


where A, B, C, F, G and H are real numbers, and A, B and C are not all zero. 
The three types of non-degenerate conic are ellipses, parabolas and hyperbo- 
las. A non-degenerate conic is an ellipse if B* — 44C < 0, a parabola if 
B* — 4AC = 0, and a hyperbola if B* — 4AC > 0. 

First, consider the case where equation (1) represents an ellipse, as illus- 
trated on the left of the figure below. We can apply a translation to move the 
centre of the ellipse to the origin, and then a rotation to align its major and 
minor axes with the directions of the x-axis and y-axis, respectively. After we 
have applied these two Euclidean transformations, the equation of the ellipse 
becomes 

vy? 
a a>b>0. (2) 
If we now apply the affine transformation ft, : (x, y) F (x’, y’), where 


C=C ia) G) 
Y) \ 0 tbo} \y)’ 
then x’ = x/a and y’ = y/b, so equation (2) becomes 


(+O =1. 


2m translation rotation 
a} —~ 
_ > 
% x 
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Notice that we have also 
obtained a proof of the 
converse of Ceva’s 
Theorem, since this 
equality is an ‘if and only 
if’ result! 


We discussed conics in 
Chapter 1. 


You met the B2 — 4AC 
test for conics in 
Theorem 3 of Section 1.3. 
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Since the translation, the rotation and the transformation f; are all affine, 
their composite must also be affine. Overall, this shows that each ellipse can 
be mapped onto the unit circle by an affine transformation. We therefore have 
the following theorem. 


Theorem 1 Every ellipse is affine-congruent to the unit circle with 
equation x? + y* = 1. 


Secondly, consider the case where equation (1) represents a hyperbola, as 
illustrated on the left of the figure below. Again, we can apply a translation to 
move the centre of the hyperbola to the origin, and then a rotation to align its 
major and minor axes with the directions of the x-axis and y-axis, respectively. 
After we have applied these two transformations, the equation of the hyperbola 
becomes 


——=1, (3) 
Under the affine transformation ft; defined above, equation (3) becomes 
@'yY- oY =1, 
that is, 
Q!-y)G@'+y)=1. (4) 
Finally, if we apply the affine transformation f2 : (x’, y’) > (x”, y”), where 
ay. fl = EA 
y” ~~ 1 1 y’ > 


then equation (4) becomes 


xy" = 1 
translation rotation ty , t 7 
y “o™ y ao y: a~ UY 
(LU | tz: > | Cc . wel | 1 \ 
Woo pS ZINT 
x 
Dropping the dashes from the equation x” y” = 1, we obtain the following 
theorem. 
Theorem 2 Every hyperbola is affine-congruent to the rectangular hyper- Recall that ‘rectangular’ 
bola with equation xy = 1. means that the asymptotes 


of the hyperbola are at 


: ; : . right angles to each other. 
Finally, consider the case where equation (1) represents a parabola, as illus- 


trated on the left of the figure below. We can apply a translation to move the 
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vertex of the parabola to the origin, and then a rotation to align its axis with the 
(positive) x-axis. After we have applied these two Euclidean transformations, 
the equation of the parabola becomes 


yey (5) 


where a is some positive number which depends on the coefficients in 
equation (1). 
Next, if we apply the affine transformation f3 : (x, y) +> (x’, y’), where 


x\ fifa 0 x 
vy} \ 0 Ta}X\y/J 
then x’ = x/a and y’ = y/a, so equation (5) becomes (ya)? = a(x’a), or 


(y’)? — x’. 


y translation  y rotation y bs y’ 
a Pee > a 
> > > 
x x Xx Y 


Dropping the dashes, we obtain the following theorem. 


Theorem 3 Every parabola is affine-congruent to the parabola with 
equation y* = x. 


Since all parabolas are affine-congruent to y?=x, they must be 
affine-congruent to each other. Similarly, by Theorem 1, all ellipses must 
be affine-congruent to each other; and, by Theorem 2, all hyperbolas must 
be affine-congruent to each other. 

This raises the question as to whether it is possible for one type of conic 
(such as an ellipse) to be affine-congruent to another type of conic (such as 
a hyperbola). The next theorem shows that this cannot happen. In fact, since 
an affine transformation can be expressed as the composite of two parallel 
projections, this should not surprise you. After all, no parallel projection can 
change a bounded curve (such as an ellipse) into an unbounded one (such as 
a parabola or a hyperbola); nor can it change a curve with two branches (a 
hyperbola) into a curve with just one branch (an ellipse or a parabola). 


Theorem 4 Affine transformations map ellipses to ellipses, parabolas to 
parabolas, and hyperbolas to hyperbolas. 


Proof Consider the non-degenerate conic with equation 


Ax? + Bxy + Cy* + Fx+ Gy + H =0, (6) 
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Here we omit the details 
of the particular 
transformations involved, 
and concentrate instead on 
the principles underlying 
the successive mappings 
which are used. 


Remember that a circle is 
a special type of ellipse. 
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and its image under an affine transformation f : x + x’ given by 
x’ = Ax+b, 


where A is an invertible 2 x 2 matrix. 
The inverse affine transformation t—! : x’ +> x is given by 


x =A7!x’—A7'p, 


which we may write in the form 


/ 
er a) 
y r s y v 
for some real numbers p, g, 7, 5, u and v. It follows that 


x=px+qy+u and y=rx+sy +v. (7) 


If we now substitute these expressions for x and y into equation (6), then the 
resulting equation is a second-degree equation in x’ and y’, so the image of the 
conic under the affine transformation ¢ must be another conic. 

Next we show that this image conic cannot be degenerate. A degenerate 
image would consist of a pair of lines, a single line, a point, or the empty 
set. Since the affine transformation t~! maps lines to lines, it would map the 
degenerate image to another degenerate conic. But this cannot happen since 
t~! maps the image back to the original non-degenerate conic (6). It follows 
that the image of (6) cannot be degenerate. 

Finally, if we substitute for x and y from equations (7) into equation (6), and 
keep careful track of the algebra involved, it turns out that the discriminant of 
the image conic is just 


(ps — rq)" (B? — 4AC). 


Here B? — 4AC is the discriminant of the original conic. Since (ps — rq)* > 0, 
the sign of the discriminant is not changed by an affine transformation of a 
conic. Hence the type of the conic is also unchanged. | 


We can combine the results of Theorems 1-4 to obtain the following 
corollary. 


Corollary In affine geometry: 


(a) all ellipses are congruent to each other; 
(b) all hyperbolas are congruent to each other; 
(c) all parabolas are congruent to each other. 


Non-degenerate conics are congruent only to non-degenerate conics of the 
same type. 
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You met this formula for 
the inverse in 
Subsection 2.2.1. 


We omit the details of 
these calculations, as they 
are complicated and 
uninformative. 


Theorem 2 of Section 2.3 


Recall that the sign of the 
discriminant of a non- 
degenerate conic 
determines the type of the 
conic. 


Here ps —rq #4 OsinceA 
is invertible. 


112 


The corollary shows that affine-congruence partitions the set of non- 
degenerate conics into three disjoint equivalence classes. One class consists 
of all the ellipses, another class consists of all the hyperbolas, and the third 
consists of all the parabolas. Each class contains one of the so-called standard 
conics x* + y* = 1,xy = land y* =x. 

Just as the Fundamental Theorem of Affine Geometry enables us to deduce 
a given result about an arbitrary triangle by showing that the result holds for an 
equilateral triangle, so the corollary enables us to deduce a given result about 
an arbitrary ellipse, hyperbola or parabola by showing that the result holds for 
the corresponding standard conic. Of course, this works only if the result is 
concerned with the affine properties of the conic, so we need to be able to 
recognize such properties. 

The following theorem shows that one such property is the property of being 
the centre of an ellipse or hyperbola. 


Theorem 5 Let ¢ be an affine transformation, and let C be an ellipse or 
hyperbola with centre R. Then #(C) has centre 7(R). 


Proof Let C’ and R’ be the images of C and R under ¢. If P’ is any point on 
C’, then it must be the image of some point P on C. Since R is the centre of 
C, we can rotate P about R through an angle z to a point Q which must also 
lie on C. Hence Q’ = t(Q) is a point on C’. 


t 


i 


Now f preserves ratios of lengths along lines, so the line segment PRQ maps 
onto the line segment P’R’Q’ with P’R’ = R’Q’. Thus if we rotate P’ about 
R’ through an angle zr, it must go to Q’ on C’. Now, as our choice for P’ as 
a point on C’ varies, so do P = t~!(P’) and Q, but the point R is always 
the same point. It follows that the midpoint of P’Q’ is always the same point 
R’ = t(R). Hence R’ = t(R) is the centre of C’, as required. | 


Another affine property is the property of being an asymptote of a hyperbola. 


Theorem 6 Let ¢ be an affine transformation, and let H be a hyperbola 
with asymptotes ¢; and £2. Then t(H) has asymptotes ¢(€;) and t(€2). 
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Recall that a parabola 
does not have a centre. 


This uses the definition of 
centre given in Chapter 1. 


This figure illustrates the 
proof for an ellipse C, but 
the proof works equally 
well for a hyperbola. 


For since PR/RQ = | it 
follows that 
P/R'/R'Q’ = 1. 
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The figure below illustrates that this theorem is plausible for parallel 
projections. 


Proof The hyperbola H possesses exactly two (distinct) families of parallel 
lines each of which fills the plane, with each member of each family meeting 
H exactly once — apart from one line in each family that is an asymptote of H, 
and so does not meet H. 

The image of H under the affine transformation ¢ is also a hyperbola, t(#). 
The images under f of the two families of parallel lines are also (distinct) fam- 
ilies of parallel lines; within each family, a line that meets H once is mapped 
onto a line that meets t( 1) once, and the single line that does not meet H maps 
onto a line that does not meet f(H). So the two exceptional lines in the image 
families must be the asymptotes of the hyperbola r(#7). 

It follows that the asymptotes of H are mapped by ¢ to the asymptotes of 
t(), as required. B 


Many of the problems concerning conics which are particularly amenable to 
solution using the methods of affine geometry involve tangents. 

This is due to the following theorem, which asserts that tangency is an affine 
property. 


Theorem 7 Let f be an affine transformation, and let @ be a tangent to a 
conic C. Then f(£) is a tangent to the conic t(C). 


The figure below illustrates the theorem for parallel projections. 


Solution We shall use the fact that a tangent to a conic (whether it is an 
ellipse, a hyperbola or a parabola) intersects the conic at exactly one point. 
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However we have to be a 
little careful. For example, 
any line parallel to its axis 
meets a parabola in 
exactly one point. 
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First, the image of an ellipse E under an affine transformation ¢ is an ellipse. 
A tangent to E is a line that intersects E in exactly one point. These properties This characterizes 
remain unchanged under an affine projection; hence the image of a tangent to _ tangents to ellipses. 
E under an affine transformation ¢ must be a tangent to r(£). 
Next, the image of a hyperbola H under an affine transformation ¢ is a hyper- 
bola. A tangent to H is a member of a family of parallel lines that fill the plane 
such that there are lines in the family that meet H twice, once and not at all; 
there are exactly two lines in the family that meet H exactly once, and these 
are tangents to H. The image of the family of lines under tis again a family of — This characterizes 
parallel lines that fill the plane; it contains lines that meet the parabola t(H) tangents to hyperbolas. 
twice and not at all, and exactly two lines that meet H exactly once. These lines 
are the images of the original tangents to H, and must themselves be tangents 
to t(H). Hence, the image of a tangent to H under an affine transformation ft 
must be a tangent to t(#). 
Finally, the image of a parabola P under an affine transformation f is a 
parabola. A tangent to P is a member of a family of parallel lines that fill 
the plane such that there are lines in the family that meet P twice, once and 
not at all; the tangent is the unique member of the family that meets P exactly This characterizes 
once. The image of the family of lines under ¢ is again a family of parallel _ tangents to parabolas. 
lines that fill the plane; it contains lines that meet the parabola t(P) twice 
and not at all, and a single line that meets P exactly once. This line is the 
image of the original tangent to P, and must itself be a tangent to t(P). 
Hence, the image of a tangent to P under an affine transformation f must be 
a tangent to t(P). 
This completes the proof. | 


In applications we often use the following facts that you met earlier. Theorem 2, 
Subsection 1.2.1. 


Tangents to Conics in Standard Form The equation of the tangent to a 
standard conic at the point (x1, y;) is as follows. 


Conic Tangent 

Unit circle x* + y* = 1 xxi + yy, =1 
Rectangular hyperbola xy = | xy, + yx, = 2 
Parabola y* = x DSN) 86 se I 


2.5.2 Applying Affine Geometry to Conics 


We are now in a position to apply the methods of affine geometry to the 

solution of problems involving conics. Of course, affine geometry can be 

helpful in this task only if the property being investigated is one which is We used these techniques 
preserved under affine transformations. The underlying idea is that we use in Subsection 2.2.3 to 

an affine transformation to map the original conic onto one of our standard — Prove the Conjugate 


: ' “ie Diameters Th fe 
conics, tackle the problem in hand there, and then map back to the original sheellipse ect 
conic. , 
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Example1 AB is a diameter of an ellipse. Prove that the tangents to the ellipse —_ Recall that the diameter 


at A and B are parallel to the diameter conjugate to AB. conjugate to AB is the set 
of midpoints of all the 


Solution First, map the ellipse onto the unit circle, by an affine transforma- chords parallel to AB (see 
tion t. Since the centre O of the ellipse maps to the centre O’ of the circle, the Subsection 2.2.3). 
image of the diameter AB is a diameter A’ B’ of the unit circle. 


All chords of the circle that are parallel to the tangents at A’ and B’ are bisected 
by A’B’, and so the diameter through O’ is the diameter conjugate to A’B’. 
Since parallel lines map to parallel lines and ratios along parallel lines are 
preserved under the inverse affine transformation t—!, it follows that all chords 
of the ellipse that are parallel to the tangents at A and B are bisected by AB, 
and so the diameter through O that is parallel to the tangents at A and B is the 
diameter conjugate to AB. 


Q 
Problem 1 An ellipse touches the sides BC, CA and AB of AABC at R 
the points P, Q and R, respectively. Prove that 
AR BP CQ _ ——y “ 
RB PC QA” 


and deduce that the lines AP, BQO and CR are concurrent. 


Problem 2 The tangents to an ellipse at two points A and B meet at a 
point 7: Prove that the line joining T to the centre O of the ellipse bisects 
the chord AB. 


The rectangular hyperbola H = {(x, y) : xy = 1} does not possess as much 
symmetry as does the unit circle; so the fact that every hyperbola is affine- A ae = 
congruent to H may not be sufficient to simplify a given problem. Fortunately, 
however, we can also arrange for any given point on the original hyperbola to 
map to the point (1, 1) on H. 
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To see this, note that for any non-zero number a, the affine transformation 


(5) ~ (¢ A, @ 


maps # to itself. For, an arbitrary point on H has coordinates of the form 
(x, 1/x), x A 0, and under fy this is mapped to the point (ax, 1/ax), which also 
lies on H. As x varies through R — {0}, its image (ax, 1/ax) varies over the 
whole of H, so the image of H under fg is the whole of H. 

So if we start with a given hyperbola and a point P on it, we can map the 
hyperbola to H by some affine transformation s. The point s(P) will then have 
coordinates (b, 1/b) for some number b € R — {0}; so if we choose a = 1/b, 
then the affine transformation fg will map s(P) to (1,1). Overall, the composite 
t = f, 08 is an affine transformation which maps the given hyperbola to H, and 
maps P to (1, 1). We now state this as a corollary to Theorem 2. 


Corollary Given any hyperbola and a point P on it, there is an affine trans- 
formation which maps the hyperbola onto the rectangular hyperbola xy = 1, 
and the point P to (1, 1). 


Example 2 The tangent at the point P on a hyperbola meets the asymptotes 
at the points A and B. Prove that PA = PB. 


4 t 


t(A) 


Solution Let t be an affine transformation which maps the hyperbola onto 
the rectangular hyperbola H = {(x,y) : xy = 1} in such a way that 
t(P) = (1, 1). Then, by Theorem 6 of Subsection 2.5.1, the asymptotes of the 
hyperbola map to the asymptotes of H; and, by Theorem 7 of Subsection 2.5.1, 
the tangent at P maps to the tangent at (1,1). 

By symmetry, (1, 1) is the midpoint of the line segment from f(A) to t(B). 
Since midpoints are preserved under the affine transformation t~!, it follows 
that P is the midpoint of AB. 


Problem 3 P is a point on a hyperbola H with centre O. Prove that 
there exists a line £ through O such that all chords of the hyperbola which 
are parallel to ¢ are bisected by OP. 
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This result is an analogue 
for the hyperbola of the 
Conjugate Diameters 
Theorem for the ellipse 
(Theorem 3 of 
Subsection 2.2.3). 


Exercises 


2.6 Exercises 


Section 2.1 


1. Let AABC be a triangle in which AB = AC. Prove that 


ZABC = ZACB. 


Hint: Consider a reflection in the bisector of ZBAC. 
2. Determine which of the following transformations t:R*—> R? are 


Euclidean transformations. 


1 V3 


Determine the composite 1, Pon. 


=5. 5 4 
t — 
@o=( 7 7 |x+(74) 
2 2 
2 1 
~3 3 3 
(b) w= (| 2)x+(2) 
3 3 
+ 2 > 
(c) t(x) = > ” x+(_3) 
V5 VS 
3. The Euclidean transformations ft; and f2 are given by 
= 2 I 
t(x) = . : x+ (~:) 
V5 V5 
and 
tL 2 2 
t2(xX) = oa . x+(_i). 
V5 5 
Determine the composites ft; 0 fz and f2 0 fy. 
4. Determine the inverse of each of the following Euclidean transformations. 
5 12 
un =4 
wro=(i «a+ 5 
3 13 
—-R _35 
13 13 
(b) 1(%) = i st (_:) 
13 3 
5. The Euclidean transformations ft and f2 are given by 
ae as 
n@=( oP | xt (_:) 
v2 V2 
and 
el. 25 I 
to(x) = - : x+({). 
V2 2 
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Section 2.2 


1. Determine whether or not each of the following transformations t : R* > 


R? is an affine transformation. 


2 —2 2 
@ 0) =(_5 ~3)x+(_1) 

> -=2 =3 
19 =(_5 ~$)x+(—) 


2 
-1 1 
(c) nix) = (| 28 


2. Write down an example (if one exists) of each type of transformation 


t : R* — R? described below. In each case, justify your answer. 

(a) An affine transformation t which is not a Euclidean transformation 
(b) A Euclidean transformation t which is not an affine transformation 
(c) A transformation t which is both Euclidean and affine 


(d) A transformation t which is one-one, but is neither Euclidean nor affine 


3. The affine transformations ft and f2 are given by 


t(x) = (; “1 ) + (_:) 
=i 2 = 
wo=(h JC). 


Determine the following composites. 
(a) trots (b) f2 o ty (c) ty of} 
4. Determine the inverse of each of the following affine transformations. 


2- 2 2 1 
(a) ni = (3 ~3)x+(4) w= (4 5)x+(3) 


5. Prove that the transformation 


t(x) = 3x (x € R’) 


and 


is an affine transformation, but not a parallel projection. 

6. Which of the following are affine properties? 
(a) distance (b) collinearity (c) circularity 
(d) magnitude of angle (e) midpoint of line segment 


Section 2.3 


1. The affine transformation t : R? — R? is given by 


oe 1 -1 ial. 2 
~A\2 -3 —4)° 
Determine the image under ¢ of each of the following lines. 


(a) y= —2x (b) 2y = 3x — 1 
2. The affine transformation t : R? — R? is given by 


4 1 
t(x) = (; t)x+(_i). 
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Determine the image under ¢ of each of the following lines. 
(a) 2x —S5y+3=0 (b)3x+y—-4=0 

3. Determine the affine transformation which maps the points (0, 0), (1, 0) and 
(0, 1) to the points: 
(a) (0, —1), C, 1) and (—1, 1), respectively; 
(b) (—4, —5), C1, 7) and (2, —9), respectively. 

4. Determine the affine transformation which maps the points (1, 1), (3, 2) and 
(4, 1) to the points (0, 1), (1, 2) and (3, 7), respectively. 

5. Determine the affine transformation which maps the points (1, —1), (5, —4) 
and (—2, 1) to the points (1, 1), (4, 0) and (0, 2), respectively. 

6. Prove that the affine transformation t for which 


t(x) = Se ne 


maps each point of the line y = x in R? onto itself. 
7. Determine the matrices A and b for the affine transformation 


t(x) = Ax+b, 


where A and b are 2 x 2 and 2 x | matrices, respectively, given that t maps 
each point of the line y = 0 onto itself and (0,1) onto (2,3). Prove also that 
t is a parallel projection of R? onto itself. 


Section 2.4 


1. The points P, Q, R and S lie on a line, in that order; the distances between 
them are 4 units, 2 units and 3 units, respectively. Determine the ratios 
PR: RS and PS: SQ. 

2. A point X lies inside a triangle AABC, and the lines AX, BX and CX meet 
the opposite sides of the triangle at P, Q and R, respectively. The ratios 
AR : AB and BP : BC are | : 5 and 3 : 7, respectively. Determine the ratio 
AC: AQ. 

3. Let £ be a line that crosses the sides BC, CA and AB of a triangle AABC 
at three distinct points P Q and R, respectively. The ratios BC : CP and 
CQ: QA are 3: 2 and | : 3, respectively. Determine the ratio AR : RB. 

4. ABCD is a parallelogram, and the point P divides AB in the ratio 2 : 1; the 
lines AC and DP meet at Q, and the lines BQ and AD meet at R. 

(a) Determine the images of P, Q and R under the affine transformation t¢ 
which maps A, D and C to (0, 1), (0, 0) and (1, 0), respectively. 

(b) By considering the image of ABCD under ¢, determine the ratios 
BQ: QRand AR: RD. 

5. The triangle AABC has vertices A(—1,2), B(—3,—1) and C(3, 1), and 
the points P (1,5), O (1,3) and R(—3,1) lie on BC, CA and AB, 
respectively. 

(a) Determine the ratios in which P, Q and R divide the sides of the 
triangle. 
(b) Determine whether or not the lines AP. BQ and CR are concurrent. 
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. The triangle AABC has vertices A(2,0), B(—3,0) and C(3,—3), and 


the points P(—1,—1), Q(,3) and R (-4. 0) lie on BC, CA and AB, 
respectively. 

(a) Determine the ratios in which P Q and R divide the sides of the triangle. 
(b) Determine whether or not the points P. Q and R are collinear. 


. AABCis atriangle, and X a point which does not lie on any of its (extended) 


sides. Also, AX meets BC at P. BX meets CA at Q and CX meets BA at R. 


Prove that 
AX _ AR . AQ 
XP RB QC 
(This result is often known as van Aubel’s Theorem.) 


. AABCis atriangle, and X a point which does not lie on any of its (extended) 


sides. Next, AX meets BC at P BX meets CA at Q and CX meets BA at R. 
Also, RO meets BC at L, PR meets CA at M and PQ meets BA at N. Prove 
that L, M and WN are collinear. 

Hint: Apply the result of Problem 5 in Subsection 2.4.3 to AABC and points 


. BL CM AN 
L, M and N in turn. Then evaluate the product TC MA” NB: 


. Three disjoint circles of unequal radii lie in the plane, their centres being 


non-collinear. Pairs of tangents are drawn to each pair of circles such that 
the point of intersection of the two tangents to each pair of circles lies 
beyond the two circles. Prove that the three intersection points are collinear. 


Section 2.5 


il. 


An ellipse touches the sides AB, BC, CD, DA of a parallelogram ABCD at 
the points P, Q, R, S, respectively. Prove that the lengths CQ, QB, BP and 
CR satisfy the equation 

CQ CR 

OB BP 
Determine the equation of the image of the parabola P with equation y = 
x* under the affine transformation t : R* +> R? given by 


1 
t(x) = © ) x. 


Show that the image of the vertex of P is not the vertex of t(P). 

Prove that for any triangle AABC there exists an ellipse that touches the 
sides AB, BC and CA at their midpoints. 

Let P(acos6,bsin@), where @ is not a multiple of 2/2, be a point on the 


ellipse C : _ + 
corresponding point on the ‘auxiliary circle’ C’: x* + y? = a’. Prove that 
the tangents at P to C and at P’ to C’ meet on the x-axis. 

Hint: Write down an affine transformation that maps C to C’ and P to P’, 
and that maps each point of the x-axis to itself. 

Given any two points P and P’ on ellipses E and E’, respectively, show 
that there exists an affine transformation that maps E to E’ and P to P’. 
Find the endpoints of the chord AB of the hyperbola H with equation xy = 1 


that is bisected by the point P(2, 1). 


= 1, where a > b > O; and P’(acos@,asin@) the 
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Direct calculation using 
coordinates is also 
possible, but very tedious! 


This proves that the 
property of ‘being a vertex 
of a parabola’ is not an 
affine property. 


This result is analogous to 
that for hyperbolas in the 
Corollary in 

Subsection 2.5.2. 


Summary of Chapter 2 


2 . . 
7. E is the ellipse with equation = + a = 1, and P (=. zz) is a point 
inside E. AB is a chord of E through P, and O is the centre of E. Find the 
maximum value of 4 as A varies on E. 


Summary of Chapter 2 


Section 2.1: Geometry and transformations 


1. An isometry of R? is a function which maps R* onto R? and preserves 
distances. 
Every isometry of R? has one of the following forms: 
e a translation along a line in R?; 

a reflection in a line in R?; 

a rotation about a point in R?; 

a composite of translations, reflections and rotations in R*. 


The set of all isometries of R* forms a group under composition of 

functions; in particular, the composite of two isometries is an isometry. 

2. Euclidean geometry is the study of those properties of figures that are 
unchanged by the group of isometries. 
These properties are called Euclidean properties, and include distance, 
angle, collinearity of points and concurrence of lines. 

3. The Kleinian view of geometry is the idea that geometry can be thought 
of in terms of a group of transformations acting on a space. 

4. The transformations of R? given by 


x cos@ —sind ‘x e d 
(Jo (S aro) (3) + (4 Jam 
x cos @ sind x e 
(yo (Ss al )+(4) 


are isometries; they represent, respectively, anticlockwise rotation about 
the origin through an angle 6 followed by a translation by (e, f), and 
reflection in a line through the origin that makes an angle 6/2 with the 
x-axis followed by a translation by (e, f). 
Every isometry of R? is of one or other of these two forms 

5. A Euclidean transformation of R? is a function t : R? > R? of the form 
t(x) = Ux +a, where U is an orthogonal 2 x 2 matrix and a € R*. The 
set of all Euclidean transformations of R? is denoted by E(2). 

6. Every isometry of R? is a Euclidean transformation of R?, and vice versa. 

7. The set of Euclidean transformations of R* forms a group under the 
operation of composition of functions. 

8. The inverse of the Euclidean transformation r(x) = Ux + a is given by 
t (x) =U7'x—U" a. 
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9. Euclidean geometry is the study of those properties of figures that are 
preserved by Euclidean transformations of R?. 

10 A figure F; is Euclidean-congruent to a figure F> if there is a Euclidean 
transformation which maps F onto F2. Loosely speaking, two figures are 
congruent if they have the same size and shape. 

Euclidean congruence is an equivalence relation. 

A figure F; is G-congruent to a figure Fy in some geometry defined 
by a group G of transformations acting on the space of the geometry if 
there is a transformation in G which maps F; onto F2. G-congruence is 
an equivalence relation. 


Section 2.2: Affine Transformations and Parallel Projections 


1. An affine transformation of R? is a function t : R? — R? of the form 
t(x) = Ax +b, where A is an invertible 2 x 2 matrix and b € R?. The set 
of all affine transformations of R? is denoted by A(2). 

Every Euclidean transformation of R? is an affine transformation. 

2. The set of affine transformations A(2) forms a group under the operation 
of composition of functions. 

The inverse of the affine transformation t(x) = Ax + b is t~!(x) = 
Av!x—Aq'b. 

3. Affine geometry is the study of those properties (called affine properties) 
of figures in the plane R? that are preserved by affine transformations. 
Basic properties of affine transformations 
Affine transformations: 

1. map straight lines to straight lines; 
2. map parallel straight lines to parallel straight lines; 
3. preserve ratios of lengths along a given straight line. 

4. A parallel projection is a one-one mapping of R? to itself defined in the 
following way. Let z, and 72 be planes in R?, with parallel rays of light 
shining through them; then the function p which maps each point P in 
zt, to the corresponding point P’ in z2 is a parallel projection from x 
onto 72. 

If wz, and z2 are parallel, then the parallel projection from zr; onto 2 is 
an isometry. 

5. Basic properties of parallel projections 
Parallel projections: 

1. map straight lines to straight lines; 
2. map parallel straight lines to parallel straight lines; 
3. preserve ratios of lengths along a given straight line. 

6. A diameter of an ellipse is a chord of the ellipse that passes through its 
centre. 

Midpoint Theorem Let £ be achord of an ellipse. Then the midpoints 
of the chords parallel to @ lie on a diameter of the ellipse. 

7. Conjugate Diameters Theorem Let ¢ be a diameter of an ellipse. Then 
there is another diameter m of the ellipse such that 
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Summary of Chapter 2 


12. 


(a) the midpoints of all chords parallel to @ lie on m; 

(b) the midpoints of all chords parallel to m lie on ¢. 

The directions of these two diameters are called conjugate directions, and 
the diameters are called conjugate diameters. 


. Given any ellipse, there is a parallel projection which maps the ellipse onto 


a circle. 


. Under parallel projection certain properties of figures, such as length 


and angle, are not necessarily preserved. This is one difference between 
Euclidean geometry and affine geometry. 


. Each parallel projection is an affine transformation. 


However, an affine transformation is not necessarily a parallel projec- 
tion. For example, the doubling map of R? to itself given by t(v) = 2v is 
an affine transformation but cannot be modelled by a parallel projection. 


. An affine transformation ¢ is completely determined by its effect on the 


three non-collinear points (0,0), (1,0) and (0,1). 

Hence if we know the points onto which (0,0), (1,0) and (0,1) are 
mapped by ¢, we can determine A and b in the formula t(x) = Ax + b, 
x € R?. 

An affine transformation can be expressed as the composite of two parallel 
projections. 


Section 2.3: Properties of Affine Transformations 


1. 


Strategy To determine the image of a line or conic in R? under an affine 
transformation t : R* — R? given by t(x) = Ax-+ b, let x and coordi- 
nates (x, y) denote points in the domain copy of R?, and x’ and coordinates 
(x’, y’) denote points in the codomain copy. Then: 

1. express the relationship between x and x’ in the form x = A!x’—A7!p; 
2. determine formulas for x and y in terms of x’ and y’; 

3. substitute for x and y in the equation of the line or conic; 

4. drop the dashes from x’ and y’. 

The resulting equation describes the image under ¢. 


. Strategy To determine the unique affine transformation f(x) = Ax +b 


which maps (0,0), (1,0) and (0,1) to the three non-collinear points p, q and 

r, respectively: 

1. take b = p; 

2. take A to be the matrix with columns given by q — p andr — p. 
Warning This Strategy requires that p, q and r are non-collinear. If 

they are collinear, the matrix A described in the Strategy is non-invertible, 

and hence the transformation given by the procedure in the Strategy is not 

an affine transformation. 


. Fundamental Theorem of Affine Geometry Let p, q, r and p’, q’, r’ be 


two sets of three non-collinear points in R?. Then: 

(a) there is an affine transformation t which maps p, q and r to p’, q’ and 
r’, respectively; 

(b) the affine transformation f is unique. 
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5. 


Strategy To determine the affine transformation t which maps three non- 

collinear points p, q and r to another three non-collinear points p’, q’ and 

r’, respectively: 

1. determine the affine transformation t; which maps (0,0), (1,0) and (0,1) 
to the points p, q and r, respectively; 

2. determine the affine transformation f2 which maps (0,0), (1,0) and (0,1) 
to the points p’, q’ and r’, respectively; 

3. calculate the composite t = fy of, : 


. Two figures are affine-congruent if there is an affine transformation which 


maps one onto the other. 

All triangles are affine-congruent. 
An affine transformation preserves ratios of lengths along parallel straight 
lines. 


Section 2.4: Using the Fundamental Theorem of Affine 
Geometry 


1. 
2. 


Median Theorem The medians of any triangle are concurrent. 
If a point R divides a line segment PQ in the ratio (1 — A) : A, then 
RO = a . The magnitude of the ratio equals the length of PR divided 


by the length of RQ, and the ratio is positive if 0 < X < 1 (when PR ane 
RO he in the same direction) and negative if 1 < 0 or A > 1 (when PR 


and RO lie in opposite directions). 


If P, Q and R have coordinates (xp, yp), (xg,yqQ) and (xp, yr), 
respectively, then es = z = and RO = = oat (If the denomina- 
tor of one of these fractions vanishes, then use the other to determine the 


ratio.) 


. Ceva’s Theorem Let AABC be a triangle, and let X be a point which 


does not lie on any of its (extended) sides. If AX meets BC at P, BX meets 
CA at Q and CX meets BA at R, then 


AR BP CQ _ 


. Converse to Ceva’s Theorem Let P Q and R be points other than ver- 


tices on the (possibly extended) sides BC, CA and AB of a triangle AABC, 


such that aK. ae on = |. Then the lines AP, BQ and CR are concurrent. 


. Menelaus’s Theorem Let AABC be a triangle, and let ¢ be a line that 


crosses the sides BC, CA and AB at three distinct points P Q and R, 
respectively. Then RR - PC’ 0A = 


. Converse to Menelaus’ Theorem Let P Q and R be points other than 


vertices on the (possibly extended) sides BC, CA and AB of a triangle 
AABC, such that AR BP CO _ = —1. Then the points P Q and R are 
collinear. 
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fe 


Many results about properties of triangles (such as collinearity, lines being 
parallel, and ratios of lengths along a given line) are preserved under affine 
transformations, are proved following a standard pattern. 

First, we choose a particular type of triangle for which it is easy to prove 
the result. Then, by asserting the existence of an affine transformation 
from that triangle to an arbitrary triangle, we may deduce that the result 
holds for all triangles. 


. Let A = (aj, a2), B = (bj, b2) and C = (c},c2) be three non-collinear 


points in the plane R*; we call AABC the triangle of reference. Then a 
point (x, y) in the plane has barycentric coordinates (&, 7, ¢) with respect 
to AABC if 


x =&a,+nbi+ Cc, 
y = €a. + nb2 + fc2, and 


l= g+n+¢. 
ay by cy J 
If we set M = | a2 bz cz |}, then M is invertible. Also | y | = 
1 1 1 1 
g 3 x 
M]{ 7] and] 7] = M7! y 
¢ c 1 
. The points P. Q and R with barycentric coodinates (1, 71, €1), (§2, 72, €2) 
&i & & 
and (&3, 3, ¢3) are collinear if and only if | 4; 2 73 | =0. 
G1 f2 
The line @ through the points with barycentric coordinates (&1, 1, 1) 
&i & & 
and (&2,72,¢2) has equation] 71 2 7 | =0. 
f& 2 


. Section Formula The point R that divides the line @ joining the points 


P and Q with barycentric coordinates (&1, 71, ¢,) and (&, 2, 2) in the 
ratio (1 — 4) : A has barycentric coordinates 


(§,7,6) =i, m1, 61) + (1 — A) (Eo, 02, 62). 


Section 2.5: Affine Transformations and Conics 


1. 


2. 


Every ellipse is affine-congruent to the unit circle with equation 
ve+yr=l. 

Every hyperbola is affine-congruent to the rectangular hyperbola with 
equation xy = |. 

Every parabola is affine-congruent to the parabola with equation y? = x. 
Affine transformations map ellipses to ellipses, parabolas to parabolas, and 
hyperbolas to hyperbolas. 

In Affine Geometry: 
(a) all ellipses are congruent to each other; 
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(b) all hyperbolas are congruent to each other; 

(c) all parabolas are congruent to each other. 

Non-degenerate conics are congruent only to non-degenerate conics of the 
same type. 

3. Let ¢ be an affine transformation, and let C be an ellipse or hyperbola with 
centre R. Then ¢(C) has centre t(R). 

4. Let ¢ be an affine transformation, and let H be a hyperbola with asymptotes 
£; and €2. Then t(#) has asymptotes ¢(€1) and t(€2). 

5. Let ¢ be an affine transformation, and let @ be a tangent to a conic C. Then 
t(£) is a tangent to the conic ¢(C). 

6. Affine geometry can be used to tackle problems involving conics when the 
property being investigated is an affine property. We first use an affine trans- 
formation to map the original conic onto one of our standard conics, tackle 
the problem in hand there, and then map back to the original conic. 

7. Given any hyperbola and a point P on it, there is an affine transformation 
which maps the hyperbola onto the rectangular hyperbola xy = 1, and the 
point P to (1,1). 


3 Projective Geometry: Lines 


Geometry is one branch of mathematics that has an obvious relevance to the 
‘real world’. Earlier, we studied some results in Euclidean geometry and we 
described the group of Euclidean transformations, the isometries. We saw 
that the Euclidean transformations preserve distances and angles, and have 
a definite physical significance. 

In this chapter we study projective geometry, a very different type of geome- 
try, that has important but less obvious applications. It was discovered through 
artists’ attempts over many centuries to paint realistic-looking pictures of 
scenes composed of objects situated at differing distances from the eye. How 
can three-dimensional scenes be represented on a two-dimensional canvas? 
Projective geometry explains how an eye perceives ‘the real world’, and so 
explains how artists can achieve realism in their work. 

In Section 3.1, we look at the development of perspective in Art and explain 
the concept of a perspectivity. We describe Desargues’ Theorem, which con- 
cerns a curious property of two triangles whose vertices are in perspective 
from a single point, and so explain that perspective can play a key role in the 
statement and the proof of theorems in mathematics. 

In Section 3.2, we define the term projective point (or Point) and call the 
set of all such Points the projective plane, which we denote by RP*. We also 
define a projective line (or Line). To enable us to tackle problems in projective 
geometry algebraically, we introduce homogeneous coordinates to specify the 
Points in RP’. 

In Section 3.3, we define the projective transformations of RP? and use 
them to define projective geometry. We also prove the Fundamental Theorem 
of Projective Geometry, which states that given two sets of four Points there is 
a unique projective transformation of RP* that maps the Points in one set to 
the corresponding Points in the other set. This crucial result enables us to apply 
a preliminary transformation to many geometric problems, thereby simplify- 
ing their solution by reducing the arithmetic involved. It turns out that there 
is a close connection between the idea of perspective in R? and projective 
transformations. 

In Section 3.4, we use the Fundamental Theorem of Projective Geometry 
to prove several results, including Desargues’ Theorem. We also introduce the 


Chapters | and 2. 


For example, in Computer 


Graphics and in Art. 


eye 


We also require that no 
three Points in either set 


lie on a Line. 
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concept of duality, which involves a remarkable relationship between Points 
and Lines. 

Finally, in Section 3.5, we note that the ideas of distance and ratio of dis- 
tances along a line have no immediate analogues in RP’; nevertheless, we are 
able to define a related quantity called the cross-ratio of four collinear Points 
in RP’. This quantity is very useful in proving various mathematical results, 
and it has ‘real life’ applications — such as in aerial photography. 


3.1. Perspective 


3.1.1 Perspective in Art 


The first ‘pictures’ were probably Cave Art wall paintings: for example, depic- 
tions of animals and hunters. Up to the Middle Ages, most pictures were drawn 
on walls, floors or ceilings of buildings and were intended to convey messages 
rather than to be accurate illustrations of what an eye might see. For example, 
Christian religious art portrayed Christ and the Saints, the Bayeux tapestry 
outlined events such as the Norman Conquest and the Battle of Hastings, and 
so on. 


(Clockwise from left) 
Hunters below antelopes. 
Bambata cave, Zimbabwe 
© M. Jelliffe; Tomb of 
Rekhmare, Thebes. 1500 
BC © Ronald Sheridan; 
Bayeux Tapestry: The 
death of Harold. 

These prints are 
reproduced by kind 
permission of A.A. & A. 
Ancient Art and 
Architecture Collection. 


To the modern eye, the people and animals in these pictures appear to be 
rather stylized, and the whole scene seems very two-dimensional. The events 
illustrated do not appear to be properly integrated into the background, even if 
this is included. 

Towards the end of the 13th century, early Renaissance artists began to 
attempt to portray ‘real’ situations in a realistic way. For example, people at the 
back of a group would be drawn higher up than those at the front — a technique 
known as terraced perspective. 


Perspective 


As artists struggled to find better techniques to improve the realism of their 
work, the idea of vertical perspective was developed by the Italian school of 
artists (including Duccio (1255-1318) and Giotto (1266—1337)). To create an 
impression of depth in a scene, the artist would represent pairs of parallel lines 
that are symmetrically placed either side of the scene by lines that meet on 
the centre line of the picture. The method is not totally realistic, since objects 
do not appear to recede into the distance in the way that might be expected. 
The problem of depicting ‘distant objects looking smaller’, with a properly 
integrated foreground and background, was tackled by many artists, including 
notably Ambrogio Lorenzetti (c. 1290-1348). 


The modern system of focused perspective was discovered around 1425 by 
the sculptor and architect Brunelleschi (1377-1446), developed by the painter 
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Simone Martini ‘Maesta’ 
Palazzo Pubblico, Sala del 
Mappamods, Siena (su 
concessione del commune 
di Siena). Foto LENSINI 
Siena. 


Giotto is sometimes called 
the ‘Father of Modern 
Painting’. 


‘Last Supper’ painted by 
Duccio; Opera del 
Duomo, Siena. Foto 
LENSINI Siena. 
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and architect Leone Battista Alberti (1404-1472), and finally perfected by 
Leonardo da Vinci (1452-1519). 

These artists realized that what the eye actually ‘sees’ of a scene are the vari- 
ous rays of light travelling from each point in the scene to the eye. An effective 
way of deciding how to depict a three-dimensional scene on a two-dimensional 
canvas so as to create a realistic impression is therefore as follows. Imag- 
ine a glass screen placed between the eye and the three-dimensional scene. 
Each line joining the eye to a point of the scene pierces the glass screen at 
some point. The set of all such points forms an image on the screen known 
as a cross-section. Since the eye cannot distinguish between light rays coming 
from the points of the actual scene and light rays coming from the correspond- 
ing points of the cross-section (since these are in exactly the same direction), 
the cross-section produces the same impression as the original scene. In other 
words, the cross-section gives a realistic two-dimensional representation of the 
three-dimensional scene. 


_w—_ie 


object screen 


The German artist Albrecht Diirer (1471-1528) introduced the term per- 
spective (from the Latin verb meaning ‘to see through’) to describe this 
technique, and illustrated it by a series of well-known woodcuts in his book 
Underweysung der Messung mit dem Zyrkel und Rychtsscheyed (1525). The 
Diirer woodcut below shows an artist peering through a grid on a glass screen 
to study perspective and the effect of foreshortening. 


Alberti wrote that the first 
necessity for a painter is 
‘to know geometry’. 


In English: /nstruction on 
measuring with compass 
and straight edge. 

We discuss foreshortening 
in Subsection 3.1.2. 


By permission of The 
British Library. © The 
British Library Board 
C.119.h.7(1). 


Perspective 


Of course, the picture displayed on the screen is just one representation of 
the scene. If the screen is placed closer to, or further away from, the eye, the 
size of the cross-section changes. Also, the screen may be placed at a different 
angle for a given position of the eye, or the eye itself may be moved to a 
different position. In each case, a different cross-section is obtained, though 
they are all related to each other. 


3.1.2 Mathematical Perspective 


To help us understand the relationship between different representations of a 
scene, we now look at perspective from a mathematical point of view. In place 
of an eye and light rays travelling to it, we use the family of all lines in R* 
through a given point. For convenience, this point will often be the origin O. 
The glass screen is replaced by a plane in R? that does not pass through the 
origin. 

In order to compare the cross-sections that appear on different screens, we 
consider two planes z and zr’ that do not pass through O. A point P in z and 
a point Q in z’ are said to be in perspective from O if there is a straight line 
through O, P and Q. A perspectivity from x to x’ centred at O is a function 
that maps a point P of z toa point Q of z’ whenever P and Q are in perspec- 
tive from O. Notice that the planes z and zr’ may lie on the same side of O as 
shown on the left below, or they may lie on opposite sides of O as shown on 
the right. 
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In terms of O representing 
an eye, the figure on the 
right corresponds to the 
observer having the ability 
to look simultaneously 
both forwards and 
backwards! 
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One complication with the above definition of a perspectivity is that the 
domain of the perspectivity is not necessarily the whole of z. Indeed, if P is 
any point of z such that OP is parallel to 2’, as shown in the margin, then P 
cannot have an image in z’, and cannot therefore belong to the domain of the 
perspectivity. From a mathematical point of view, this need to exclude such 
exceptional points from the domain of a perspectivity turns out to be rather a 
nuisance. In Subsection 3.2.3 we shall therefore reformulate the definition of 
a perspectivity in such a way that these exceptional points can be included in 
the domain. 


Even with only the preliminary definition of perspectivity given above, it is 
clear that some features of figures are preserved under a perspectivity, while 
others are not. For example, the figure on the left below illustrates a particular 
perspectivity in which a line segment in one plane maps onto a line segment 
in another plane. This suggests that collinearity is preserved by a perspectivity. 
On the other hand, the figure on the right illustrates a perspectivity in which a 
circle in one plane appears to map to a parabolic shape in another plane, which 
suggests that ‘circularity’ is not preserved. 


One of our main tasks is to study the images of standard configurations such 
as lines and conics under perspectivities. This chapter deals with lines; the next 
chapter deals with conics. 

Consider a perspectivity with centre O that maps points in a plane z to 
points in a plane zr’. A convenient way to visualize the image of a line @ under 
the perspectivity is to consider an arbitrary point P on €. As P moves along @, 
the line OP sweeps out a plane. The line ¢’ where this plane intersects zr’ is the 
image of ¢. 


Perspective 


To be specific, consider the perspectivity p with centre O that maps points 
in a horizontal plane z to points in a vertical plane z’, and let L be the line 
where z and z’ intersect. Under p, every line @ in z that is parallel to L maps 
to a horizontal line ¢’ in zr’. In particular, L maps to itself. The only exception 
is the line / that passes through the foot of the perpendicular from O to zr. 
This line does not have an image in z’ since the lines joining points of h to O 
are parallel to z’. 


Next, consider the image under the same perspectivity p of a line @ in z that 
is perpendicular to L. To do this, let P denote the foot of the perpendicular 
from O to the plane z’. Although P is not the image of any point of zr, the 
plane through O and ¢ meets z’ in some line ¢’ that passes through P. It 
follows that the image of @ under p is some line ¢’ through P, with the point 
P itself omitted. 


The above argument works for any line in z that is perpendicular to L. All 
such lines are mapped by the perspectivity p to lines in x’ that pass through 
P, and that omit the point P itself. 

We may combine our observations concerning lines in z that are parallel 
to L or perpendicular to L in the following way. Let ABCD be a rectangle in 
z on the opposite side of L from O, with sides AB and CD that lie on lines 
£, and 2, perpendicular to L. Then AD and BC both map onto horizontal 
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lines in x’ between L and P. As the side BC recedes from L, its image B’C’ 
under the perspectivity p moves further up zr’ towards P, becoming shorter as 
it moves. 


To an observer whose eye is located at O, the lines 2; and ¢2 appear to meet 
‘at infinity’, and this corresponds to their images under p appearing to meet at 
P. The point P is called the principal vanishing point of the perspectivity p 
because the images in z’ of all lines in z perpendicular to L appear to vanish 
there. 

In fact, a perspectivity has many vanishing points. For instance, let £ be any 
line in z that intersects L at an angle of 7/4. Now let h’ be the horizontal line 
in z’ through P, and let D be the point on h’ such that OD is parallel to @. 
Then the plane through O and £ meets z’ in some line ¢’ that passes through 
D. It follows that the image of ¢ under p is a line through D, with the point D 
itself omitted. 


The point D is called a diagonal vanishing point of the perspectivity. All 
lines in the plane z that are parallel to the given line @ have images in zr’ that 
are lines through D, with the point D itself omitted. 

In the same way, each point of the horizontal line h’ in z’ through P is a van- 
ishing point for the images of all lines in z in some direction; hence the line h’ 
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Artists describe this 
shortening of the image on 
x’ of lines of equal length 
in z as foreshortening. 


You can think of €; and £9 
as a pair of railroad lines 
disappearing into the 
distance. 


Here the symbol zr is 
being used in two 
different ways: as a label 
for the embedding plane, 
and as an angle. 


That is, the images appear 
to vanish at D. 


Perspective 


is called the vanishing line. It corresponds to the ‘horizon’ in the plane — in 
other words, to the points ‘at infinity’ towards which an observer’s eye is 
pointing when looking in a horizontal direction. 


3.1.3 Desargues' Theorem 


The idea that information in three dimensions can be related to information in 
two dimensions, and vice versa, plays an important role in mathematics just 
as it does in Art. For example, consider the following three-dimensional figure 
that consists of two triangles AABC and AA’B’C’ which are in perspective 
from a point U. For the moment we shall assume that no pair of corresponding 
sides BC and B’C’, CA and C’A’, and AB and A’B’, are parallel. 


We shall show that this three-dimensional figure has the property that BC 
and B’C’, CA and C’A’, AB and A’B’ meet at P, Q, R, respectively, where 
P,Q and R are collinear. This will enable us to formulate an equivalent two- 
dimensional result, known as Desargues’ Theorem. 

To prove the three-dimensional result, observe that both BC and B’C’ lie in 
the plane that passes through the points U, B and C. Since BC and B’C’ are 
coplanar but not parallel, they must meet at some point P. 


Similarly, the sides CA and C’A’ meet at some point Q, and the sides AB and 
A’B’ meet at some point R. 

Since the points P, Q and R lie both on the plane which contains the trian- 
gle AABC and on the plane which contains the triangle AA’B’C’, they must 
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Girard Desargues 
(1593-1662) was a French 
engineer and architect. 
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lie on the line @ where the two planes meet. It follows that P,Q and R are 
collinear. 


To obtain the equivalent two-dimensional result, imagine that you are view- 
ing the three-dimensional configuration through a transparent screen. Since 
this viewing process will not alter the collinearity of points or the coincidence 
of lines, we may reinterpret the three-dimensional result in terms of the image 


on the screen to obtain the following theorem. We give a rigorous proof 
of Desargues’ Theorem in 
Theorem 1, 
Theorem 1  Desargues’ Theorem Subsection 3.4.1. 


Let AABC and AA’ B’C’ be triangles in R? such that the lines AA’, BB’ and 
CC’ meet at a point U. Let BC and B'C' meet at P, CA and C’A’ meet at Q, 
and AB and A’B’ meet at R. Then P, QO and R are collinear. 


Strictly speaking, we have not proved this theorem since it is not immedi- 
ately obvious that AABC and AA‘B'C’ can be obtained as images of triangles 
in R? which have corresponding sides that are not parallel. Nevertheless, the 
above argument does provide reasonably convincing evidence that the theorem 
is true. 

One remarkable feature of the above argument is the way in which the geom- 
etry of the figure on the transparent screen is characterized by the rays of light 
that enter an eye. Thus a point on the screen corresponds to a single ray of 
light that enters the eye, a line on the screen corresponds to a plane of rays 
of light that enter the eye, and so on. The geometry of the figure can be 
investigated entirely in terms of these rays of light. The screen is needed only 
to interpret the result in terms of a two-dimensional figure. 


The Projective Plane RP? 


In the rest of Chapters 3 and 4, we introduce a geometry known as pro- 
jective geometry that enables us to work with figures on a plane (a screen) 
as if they correspond to rays of light that enter an eye in the way described 
above. 


3.2 The Projective Plane RP? 


You have already met the Kleinian view that a geometry consists of a group of 
transformations acting on a space of points. In this section we begin our dis- 
cussion of projective geometry by investigating its space of points. The group 
of transformations is discussed in Section 3.3. 


3.2.1. Projective Points 


Imagine an eye situated at the origin of R? looking at a fixed screen. As we 
mentioned in Subsection 3.1.1, each point of the screen corresponds to the ray 
of light that enters the eye from the point. This correspondence between points 
of the screen and rays of light through the origin is the clue that we need to 
define a space of points for our new geometry. 


screen 


Point 


Ly 


eye 


Rather than use the points of the screen directly, we use the rays of light 
that enable an eye to ‘see’ the points from the origin. We can express this 
idea mathematically by defining a projective point to be a Euclidean line in 
R? that passes through the origin. In order to avoid confusion with Euclidean 
points of R*, we write Point with a capital P whenever we mean a projective 
point. 


Definitions A Point (or projective point) is a line in R? that passes 
through the origin of R>. The real projective plane RP? is the set of all 
such Points. 
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Introductary remarks to 
Chapter 2 


It is important that you 
use the capital letter P in 
‘Point’. 
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bay 


In order to prove results in projective geometry algebraically, we need to 
have an algebraic notation that can be used to specify the Points of RP*. To 
do this, we use the fact that a line £ through the origin O in R? is uniquely 
determined once we have specified a Euclidean point (other than O) that lies 
on ¢. For example, there is a unique line ¢ in R? through O and the point with 
Euclidean coordinates (4, 2, 6), so we can use these coordinates to specify a 
projective point. When doing this we write the coordinates in the form [4, 2, 
6], with square brackets to indicate that the coordinates refer to a projective 
point. 


Definition The expression [a, b, c], in which the numbers a, b, c are not 

all zero, represents the Point P in RP? which consists of the unique line 

in R? that passes through (0, 0, 0) and (a, b, c). We refer to [a, b, c] as Note that [0, 0, 0] is not 
homogeneous coordinates of P. If (a, b, c) has position vector v, then we _—_ defined. 

often denote P by [v] and we say that P can be represented by v. 


Remark 


Often we abuse our notation slightly, by talking about ‘the Point [a, b, c]’ 
when strictly speaking we should say ‘the Point with homogeneous coordinates 
[a, b, c]’. 


Notice that the homogeneous coordinates of a Point are not unique. For 
example, the Point with homogeneous coordinates [4, 2, 6] consists of a line 
that passes through (0, 0, 0) and (4, 2, 6). But this line also passes through # 
(—2, —1, —3), so [4, 2, 6] and [—2, —1, —3] both represent the same Point. 

In general, if (a, b, c) is any point on a line through the origin, and A is any 
real number, then (Aa, Ab, Ac) also lies on the line. Moreover, if (a, b,c) is not 
at the origin and 1 ¥ 0, then (Aa, Ab, Ac) is not at the origin either. It follows 
that [a, b, c] and [Aa, Ab, Ac] both represent the same Point, for any A 4 0. We 
express this by writing 


[a,b,c] = [Aa, Ab, Ac], for any A 0. (1) aN) 
Point (a, b, c) 
Conversely, if there is no non-zero real number A such that oO 


(a’,b’,c’) = (Aa, Ab, Ac), 


then (a,b,c) and (a’,b’,c’) cannot lie on the same line through the origin, 
and so the homogeneous coordinates [a, b, c] and [a’,b’,c’] must represent 
different Points in RP*. 


Example 1 Which of the following homogeneous coordinates represent the 
same Point in RP? as [6, 3, 2]? 


(a) [18,9,6]  (b) [12,-6,4] © [1.4.4] (d) [1, 2, 3] 


The Projective Plane RP? 


Solution 
(a) This represents the same Point as [6, 3, 2], for if A = 3, then 
[18,9, 6] = [6A, 3A, 2A] = [6, 3, 2]. 


(b) This represents a Point different from [6, 3, 2], for there is no A that 
satisfies the simultaneous equations 


12 = 6A, -6 = 31,4 = 2d. 
(c) This represents the same Point as [6, 3, 2], for if A = a then 
[1.5.5] = (64.34, 24] = 16,3, 21. 


(d) This represents a Point different from [6, 3, 2], for there is no 4 that 
satisfies the simultaneous equations 


1=6,, 2=3A, 3=2)., 


Problem 1 Which of the following homogeneous coordinates repre- 
sent the same Point in RP” as [1, 2, 3]? 


(a) [2,4,6] (b)[1,2,—3] (c)[—1,—2,-3] (d) [1], 12, 13] 


At first sight it may seem rather unsatisfactory that the coordinates of a Point 
are not unique. However, this ambiguity can often be turned to our advantage. 
For example, if a calculation yields a Point of RP? with fractional homo- 
geneous coordinates such as [}. 7 |; then the rest of the calculation may 
be simpler if we ‘clear’ the fractions and represent the Point by the integer 
homogeneous coordinates [6, 3, 2] instead. 


Problem 2 For each of the following homogeneous coordinates, find 
integer homogeneous coordinates which represent the same Point. 


31 _1 2 es ea 
(a) [3.3.-3] [043] @[%-4.-3] 
Given a collection of homogeneous coordinates, it is not always easy to spot 


those that represent the same Point. In such cases it is sometimes possible to 
rewrite the coordinates in a form that makes the comparison easier. 


Example 2 Determine homogeneous coordinates of the form [a, b, 1] for the 
Points 


[2, —1, 4], [4, 2, 8], [27, —1, 477], 


1 1 
[200, 100, 400], spt] F [6, —9, —12]. 


Hence decide which homogeneous coordinates represent the same Points. 
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Throughout the solution 
we use equation (1): 


[a,b,c] = [Aa, Ab, Ac], 


for any 1 4 0. 
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Solution According to equation (1), a Point of RP? is unchanged if its 
homogeneous coordinates are multiplied (or divided) by any non-zero real For, dividing by a 
number. Since the third coordinate of each Point is non-zero, we may divide non-zero number A is 


by this third coordinate to obtain homogeneous coordinates of the form divalent to multiplying 
by the non-zero number 


[a, b, 1] as follows: 1/a. 
(14) = [3-41]; [4,2, 8] = [2-41]: 
[27,-2,4n]=|$,-4.1]; (200, 100,400] = | 5, 4.1]; 
[-4.-4.-1] = E i. ii [6, —9, -12] = [-4: 3, 1]. 

Since [a, b,1] = [a’, b’, 1] if and only if a = a’ and b = V’, it follows that: 


[2,—1,4] and [27, —z, 47] represent the same Point; 


[4, 2, 8], [200, 100, 400] and [-2: = i _ 1] represent the same Point; 


[6, —9, —12] represents none of the other Points. 


Notice that the method used in Example 2 works only if the third coordinates 
of all the Points are non-zero. If this is not the case, then you may still be able 
to apply the technique using the first or second coordinates. 


Problem 3 Determine homogeneous coordinates of the form [1, b,c] 
for the Points 


[2,3,-5],  [—8,—12,20], | v2, v3,-v5], 
[4,—6,10],  [—-20,—30,50], 74, 148, 0}. 


Hence decide which homogeneous coordinates represent the same 
Points. 


Having defined projective points, we are now in a position to define a pro- 
jective figure. Just as a figure in Euclidean geometry is defined to be a subset 
of R?, so figures in projective geometry are defined to be subsets of RP’. 


Definition A projective figure is a subset of RP’. 


Projective figures are just sets of lines in R* that pass through the origin. 
Thus a double cone with a vertex at O, and a double square pyramid with 
a vertex at O, are both examples of projective figures, for they can both be 
formed from sets of lines that pass through the origin of R>. 


The Projective Plane RP? 


i | 


3.2.2 Projective Lines 


A particularly simple type of projective figure is a plane through the origin. 
Such a plane is a projective figure because it can be formed from the set of 
all Points (lines through the origin of R*) that lie on the plane. Since all but 
one of these Points can be thought of as rays of light that come from a line 
on a screen, it seems reasonable to define any plane through the origin to be a 
projective line. 


screen 
plane | plane 


Just as we use ‘Point’ to refer to a “projective point’, so we use ‘Line’ to 
refer to a ‘projective line’. The use of a capital L avoids any confusion with 


lines in R?. 


Definitions A Line (or projective line) in RP” is a plane in R? that passes 
through the origin. Points in RP? are collinear if they lie on a Line. 


Since a Line in RP? is simply a plane in R? that passes through the origin, 
it must consist of the set of Euclidean points (x, y, z) that satisfy an equation 
of the form 


ax +by+cz=0, 


where a, b and c are real and not all zero. We can interpret this fact in terms of 
RP? as follows. 
Theorem 1 The general equation of a Line in RP? is 
ax + by +cz =0, (2) 


where a, b, c are real and not all zero. 
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The exception is the ray of 
light parallel to the screen. 
We shall discuss the 
significance of this ray 
later, in Subsection 3.2.3. 
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Remark 


1. The equation of a Line is not unique, for, if A 4 0, then Aax+Aby+Acz = 0 
is also an equation for the Line. We can use this fact to “clear fractions’ from 
the coefficients just as we did for the homogeneous coordinates of a Point. 

2. From the figure in the margin it is clear that a Point lies on a Line, or 


a Line passes through a Point, if and only if the Point has homogeneous ee 
coordinates [x, y,z] which satisfy the equation of the Line. For example, 


[1,—-1, 1] lies on the Line 3x + y —2z =0, but [0, 1, 3] does not. C= 


In Euclidean geometry there is a unique line that passes through any two 
distinct points, as illustrated on the left of the figure below. Similarly, in pro- 
jective geometry two distinct Points (lines through the origin) lie on a unique 
Line (plane through the origin). 


Line 


line 


point 


point 


We express this observation in the form of a theorem, as follows. 


Theorem 2  Collinearity Property of RP? 
Any two distinct Points of RP? lie on a unique Line. 


It is sometimes possible to find an equation for the Line that passes through 
two distinct Points of RP* simply by spotting an equation of the form (2) that 
is satisfied by the homogeneous coordinates of both Points. 


Example 3 For each of the following pairs of Points, write down an equation 
for the Line that passes through them. 


(a) [3, 2, O] and [3, 4, 0] (b) [1, 2, 1] and [3, 0, 3] 
(c) [1, 0, 0] and [0, 0, 1] 


Solution 


(a) Both the Points have a z-coordinate equal to 0, so the homogeneous coor- 
dinates must satisfy the equation z = 0. This equation is of the form (2) The equation x = 3 is not 
with a = 0,b = O and c = 1, so it must be the required equation for the of the form (2), and so is 


ine not the equation of a Line. 


The Projective Plane RP? 


(b) The homogeneous coordinates of both Points satisfy x = z. This equation 
is of the form (2) with a = 1, b = 0 andc = —1. It must therefore be the 
required equation for the Line. 

(c) The homogeneous coordinates of both Points satisfy y = 0. This equation 
is of the form (2) with a = 0, b = | and c = 0, so it must be the required 
equation for the Line. 


Problem 4 For each of the following pairs of Points, write down an 
equation for the Line that passes through them. 


(a) [0, 1, O] and [0, 0, 1] (b) [2, 2, 3] and [3, 3, 7] 


But how do we find an equation for a Line through two given Points in cases 
where it cannot be found by inspection? As an example, consider the Points 
[2,—1,4] and [1,—1,1]. We could certainly substitute the values x = 2, 


y = -l,z =4andx = 1, y = —1, z = 1 into equation (2), to obtain 
the pair of simultaneous equations 
2a—b+4c=0, 
a-—b+c=0. 


Then subtracting twice the second equation from the first, we obtain 
b = —2c. So from the second equation it follows that a = —3c. If we set 
c = —1, say, then a = 3 and b = 2, so an equation for the Line is 


3x+2y—z=0. 


In this case the calculations are fairly straightforward, but there is an alter- 
native method that is often simpler. Notice that the Line in RP? through the 
Points [2,—1,4] and [1, —1, 1] is the Euclidean plane in R? that contains the 
position vectors of the points (2, —1,4) and (1, —1, 1) in R.A point (x, y, z) 
lies in this plane if and only if the vector (x, y, z) is a linear combination of 
the vectors (2, —1,4) and (1, —1, 1); in other words, if and only if the vectors 
(x, y, Zz), (2, —1,4) and (1, —1, 1) are linearly dependent. 

But three vectors in R? are linearly dependent if and only if the 3 x 3 deter- 
minant that has these vectors as its rows is zero. It follows that (x, y, z) lies 


in the plane containing the position vectors (2,—1,4) and (1,—1, 1) if and 
only if 


x y Zz 
2 -l 4,=0. 
1 -1 1 


Translating this statement back into a statement concerning RP, we deduce 
that the Point [x,y,z] lies on the Line through the Points [2,—1,4] and 
[1,—1, 1] if and only if 

x y 2Z 
2 -l 4/=0. 
1 -1 1 


143 


Of course, we could set c 
to have any non-zero 
value, but c = —1 keeps 
the calculation simple. 


Line 
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Expanding this determinant in terms of the entries in its first row, we obtain 


= “1 4 2 4 @: a4 
2 -1 4/=x - 
tots = 1 yt leet a 
= 3x+2y —z. 
Hence an equation for the required Line in RP? is 
3x +2y—z=0. (3) 


Remark 

It is always sensible to check your arithmetic by checking that the two given 
Points actually lie on the Line that you have found. For instance, the answer 
above is correct, since equation (3) is a homogeneous linear equation in x, y 
and z, and the equation is satisfied by x = 2, y = —1,z = 4 and byx = l, 
y=-l1,z=1. 


We may summarize the above method in the form of a strategy, as follows. 


Strategy To determine an equation for the Line in RP? through the Points 
[d,e, f] and [g,h, k]: 


1. write down the equation 


2. expand the determinant in terms of the entries in its first row to obtain 
the required equation in the form ax + by + cz = 0. 


Example 4 Find an equation for the Line that passes through the Points 
[1, 2, 3] and [2, —1, 4]. 


Solution An equation for the Line is 


x y Zz 
1 2 3}/=0 
2 -1 4 
Now 
x yy Zz 
2 A aa 1 ‘| 
bk 2) 3lh=x —y +Z 
> -1 4 —-1 4 2 4 2 -1 


11x + 2y — 5z. 


The Projective Plane RP? 145 


An equation for the Line is therefore You can easily check that 
the Points [1, 2, 3] and 
Ilx +2y —5z=0. [2, —1,4] lie on this Line. 


Problem 5 Determine an equation for each of the following Lines in 
RP*: 


(a) the Line through the Points [2, 5, 4] and [3, 1, 7]; 
(b) the Line through the Points [—2, —4, 5] and [3, —2, —4]. 


A similar technique can be used to check whether three given Points are Line 
collinear. Indeed, three Points [a,b,c], [d,e, f], [g,,k] are collinear if and 
only if the position vectors of the points (a, b,c), (d, e, f), (g,h, k) are linearly 
dependent; that is, if and only if 


Example 5 Determine whether the Points [2, 1, 3], [1, 2, 1] and [—1, 4, —3] 
are collinear. 


Solution We have 


2 1 4 
i, 2 1 =2| 22, ft 3-1 ‘| 
> a ag 


= 2(-6 — 4) — (-3+ 1) +3(4+ 2) 
= —20+2+ 18 
=U 


Since this is zero it follows that [2, 1, 3], [1, 2, 1] and [—1,4,—3] are 
collinear. 


We summarize the method of Example 5 in the following strategy. 


Strategy To determine whether three Points [a, b,c], [d,e, f], [g,h, k] are 
collinear: 


a b 
1. evaluate the determinant |d_ e 
g h 


2. the Points [a,b,c], [d,e, f], [g,4,k] are collinear if and only if this 
determinant is zero. 
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Problem 6 Determine whether the following sets of Points are 
collinear. 


(a) [1,2, 3], [1, 1, —2], [2, 1, —9] (b) [1, 2, —1], [2, 1, 0], [0, —1, 3] 


Before rushing to solve a problem using determinants, you should always 
stop to see if you can solve the problem more easily by inspection. For exam- 
ple, suppose that you are asked to check whether the Points [1, 0, 0], [0, 1, 0], 
[1, 1, 1] are collinear. Clearly, [1, 0, 0] and [0, 1, 0] lie on the Line z = 0, 
whereas [1, 1, 1] does not, so the Points are not collinear. 


Problem 7 Verify that no three of the Points [ 1, 0, 0], [0, 1, 0], 
[0, 0, 1 ] and [1, 1, 1] are collinear. 


The Points that you considered in Problem 7 play an important part in our 
development of the theory of projective geometry, so we give them special 
names. 


Definitions The Points [1, 0, 0], [0, 1, 0], [0, 0, 1] are known as the 
triangle of reference. The Point [1, 1, 1] is called the unit Point. 


Next, observe that any two distinct Lines necessarily meet at a unique Point. 
Indeed, a Line in RP? is simply a plane in R? that passes through the origin, 
and two distinct planes through the origin of R? must intersect in a unique 
Euclidean line through the origin; that is, in a Point. This is very different to 
the situation in Euclidean geometry where parallel lines do not meet. 


Theorem3 Incidence Property of RP” 
Any two distinct Lines in RP? intersect in a unique Point of RP. 


We can determine the Point of intersection of two Lines simply by solving 
the equations of the two Lines as a pair of simultaneous equations. 


Example 6 Determine the Point of intersection of the Lines in RP? with 
equations x + 6y —5z = Oandx —2y+z=0. 


Solution At the Point of intersection [x, y, z] of the two Lines, we have 
x+6y —5z=0, 
x—2y+z=0. 


Subtracting the second equation from the first, we obtain 
8y — 6z = 0, 


so that y = 3 z. Substituting this into the second equation, we obtain x = 5 Ze 
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Lines —+\ 


This result neatly 
complements Theorem 2, 
the Collinearity Property 
of RP?. 


We know that there is a 
unique Point of 
intersection, by 
Theorem 3. 
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It follows that the Point of intersection has homogeneous coordinates 


[22 zz, z| which we can rewrite in the form E i, 1 or [2, 3, 4]. 


Problem 8 Determine the Point of intersection of each of the follow- 
ing pairs of Lines in RP: 


(a) the Lines with equations x — y —z =Oandx+5y+2z=0; 
(b) the Lines with equations x + 2y — z = Oand 2x + y —4z = 0. 


Problem 9 Determine the Point of RP? at which the Line through 
the Points [1,2,-—3] and [2,—1,0] meets the Line through the Points 
[1,0, —1] and [1, 1, 1]. 


In some cases we can write down the Point at which two Lines intersect 
without having to solve any equations at all. For example, the Lines with 
equations x = 0 and y = 0 clearly meet at the Point [0, 0, 1]. 


Problem 10 Determine the Point of RP? at which the Line through 
the Points [1, 0, 0] and [0, 1, 0] meets the Line through the Points 
[0, 0, 1] and [1, 1, 1]. 


3.2.3 Embedding Planes 


So far we have used three-dimensional space to develop the theory of projec- 
tive geometry. In practice, however, we want to use projective geometry to 
study two-dimensional figures in a plane. In order to do this, we now inves- 
tigate a way of associating figures in a plane with figures in RP, and vice 
versa. 

Suppose that a plane z contains a figure F. We can place z into R*, making 
sure that it does not pass through the origin, and then construct a corresponding 
projective figure by drawing in all the Points of RP that pass through the 
points of F. For example, if F is the triangle shown on the left below, then 
the corresponding projective figure is a double triangular pyramid. Note that 
if we change the position of z in R*, we obtain a different projective figure 
corresponding to F. 


TT 
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Note that z 4 0, since 
[0, 0, 0] are not allowed as 
homogeneous coordinates. 


It is a double pyramid 
because the Points which 
make up the pyramid are 
lines that emerge from the 
origin in both directions. 
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Conversely, suppose that we start with a projective figure F. The corre- 
sponding Euclidean figure in z consists of the Euclidean points where the 
Points of F pierce 2. For example, if F is a double cone whose axis is at right 
angles to the embedding plane, as shown on the right above, then the corre- 
sponding Euclidean figure is a circle. Note that if we change the position of z 
in R3, we obtain a different plane figure corresponding to F. 

This correspondence between projective figures and Euclidean figures works 
well provided that each Point of the projective figure pierces the plane zr, as 
shown in the margin. Unfortunately, any Point of RP* that consists of a line 
through the origin parallel to x does not pierce z, and so cannot be associated 
with a point of z. Such a Point is called an ideal Point for z. 

All the ideal Points for z lie on a plane through O parallel to 2. This plane 
is a projective line known as the ideal Line for z. 


yy 4D 


O an ideal Point 
for 7 ideal 
Points 
the ideal for © 
Line for 7 


How can we represent a projective figure on z if the figure includes some 
of the ideal Points for 7? As a simple example, consider the Line illustrated 
in the margin. This is a projective figure which intersects z in a line ¢. Every 
Point of the Line pierces the embedding plane at a point of @ except for the 
ideal Point P which cannot be represented on z. In order to represent the Line 
completely, we need not only the line ¢ but also the ideal Point P. In other 
words, the Line is represented by € U { P}. 

In general, a projective figure can be represented by a figure in 2 provided 
that we are prepared to include a subset of Points taken from the ideal Line for 
zc. In order to allow for these additional ideal Points, we introduce the concept 
of an embedding plane. 


Definitions An embedding plane is a plane, 2, which does not pass 
through the origin, together with the set of all ideal Points for 7. The plane 
in R? with equation z = 1 is called the standard embedding plane. The 
mapping of RP? into the standard embedding plane is called the standard 
embedding of RP”. 


We may summarize the above discussion by saying that for a given embed- 
ding plane, every projective figure in RP* corresponds to a figure in the 


Point in RP 


corresponding 
Euclidean 


An ideal 
Line in Point P 
for m 


We frequently use z to 
denote both a plane and an 
embedding plane, but no 
confusion should arise. 
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embedding plane, and vice versa. The figure in the embedding plane may 
include some ideal Points but is otherwise a Euclidean figure. 

If two embedding planes are parallel to each other, the same Points of 
RP? correspond to ideal Points of the embeddings; whereas, if the embed- 
ding planes are not parallel, different Points of RP” correspond to ideal Points 
of the two embedding planes. 

Once we have represented a projective figure in an embedding plane, we can 
investigate the relationship between its Points and Lines without having to refer 
to three-dimensional space at all. For example, consider the representation 
of the triangle of reference and unit Point on the embedding plane x + y + 
z = 1, shown on the left below. If we extract the embedding plane from R?, as 
shown on the right, we can use the algebraic theory developed earlier to write 
down an equation for the Line through any two given Points, without reference 
to R3. 


[1,0,0] 


[0,0,1] 


Similarly, we can use the algebraic techniques to calculate the homogeneous 
coordinates of the Point of intersection of any two given Lines. 


Problem 11 On the right-hand diagram above, insert the homoge- 
neous coordinates of the Points where the Lines through [1, 1, 1] meet 
the sides of the triangle of reference. 


Any plane may be used as an embedding plane provided that it does not pass 
through the origin. For example, if we take zr to be the plane z = —1, then the 
ideal Line for z has equation z = 0, and the ideal Points are Points of the form 
[a, b,0], where a and b are not both zero. Any other Point [a, b, c] has c 4 0 and 
can therefore be represented in z by the Euclidean point(—a/c, —b/c, —1). 


Problem 12 Let z be the embedding plane y = —1. Describe the 
ideal Points for z, and specify the Euclidean point of 2 which represents 
the Point [2, 4, 6]. 


(-a/lc,—b/e,-1) 


ideal Line 
z=0 


embedding 
plane z=—1 
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Although we can choose any embedding plane to represent figures of RP, 
the representation does depend on the choice. For example, suppose that zr, is 
the embedding plane y = —1, and that 2 is the embedding plane z = —1. 
Now consider the projective figure which consists of two Lines €; and €2 with 
equations x = —z and x = z, respectively. These Lines intersect at the Point 
[0, 1, 0]. 


z=-l 


On the embedding plane zr; the Lines €; and €2 are represented by two lines __ This is the mathematical 
that can be seen to meet at the point corresponding to [0, 1, 0]. However, on fact which explains why 
the embedding plane zr the Point of intersection [0, 1, 0] is an ideal Point and artists sometimes draw 
so the Lines £; and £2 are represented by parallel lines that do not appear to eee pee a 
meet. The contrast between the two representations of ¢; and £2 is particularly intersecting lines. 
striking if we extract the two embedding planes from R? and lay them side by 
side, as follows. 


Ty 


[0,1,0] is 


ideal for 7, 


2a) Ly 


This example illustrates that Lines which appear to be parallel in one embed- 
ding plane may not appear to be parallel in another embedding plane. In the 
next section you will see that the transformations of projective geometry are 
chosen so as to ensure that the projective properties of a figure are unaffected 
by the choice of embedding plane. Since parallelism does depend on the choice 
of embedding plane, it cannot be a projective property, so the concept of 
parallel Lines is meaningless in projective geometry. 
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3.2.4 An equivalent definition of Projective Geometry 


In our work on projective geometry, we have used Euclidean points in a 
plane in R? to construct the projective points (Points) of the geometry RP’, 
homogeneous coordinates for those Points, and projective lines (Lines). 

Equivalently, we could have defined RP? as the set of ordered triples 
[a,b,c], where a, b, c are real and not all zero, with the convention that we 
regard [Aa, Ab, Ac] and [a,b,c] (where A 4 0) as the same Point in the geom- 
etry. We would then have defined projective lines (Lines) as the set of points 
[x, y,z] in RP? that satisfy an equation of the form ax + by + cz = 0, where 
a, b, c are real and not all zero, Then we would continue to develop the theory 
of projective geometry in the same way as we have done here. 

However, we chose to start our work by looking at a model of RP 
obtained by using an embedding plane z in R?> that does not pass through 
the origin. We modeled the projective points [a,b,c] by the Euclidean lines 
through the origin and the corresponding Euclidean points (a,b,c), plus 
“points at infinity’ (the ideal Points); and we modeled the projective lines 
by Euclidean planes through the origin, For convenience, we chose often to 
use Euclidean points (a,b,c) on a given embedding plane to describe the 
Euclidean model. 

The formal method of defining projective geometry, though, is less intuitive 
than the description motivated by the R* model! 


3.3. Projective Transformations 


3.3.1. The Group of Projective Transformations 


By now you should be familiar with the idea that a geometry consists of a space 
of points together with a group of transformations which act on that space. 

Having introduced the space of projective points RP? in Section 3.2, we 
are now in a position to describe the transformations of RP”. First we shall 
define the transformations algebraically, then we give a geometrical inter- 
pretation of the transformations using the ideas of perspectivity introduced 
in Section 3.1, and finally meet the Fundamental Theorem of Projective 
Geometry. 

Recall that a point of R* (other than the origin) on an embedding plane 
z (that does not pass through the origin) has coordinates x = (x, y,z) with 
respect to the standard basis of R*, and homogeneous coordinates of the 
corresponding Point [x] in RE? (which represents the points {Ax : A € R}) 
are [Ax,Ay,Az] for some real 4 # 0. Since the Points of RP? are just 
lines through the origin of R*, we need a group of transformations that 
map the lines through the origin of R° onto the lines through the origin 
of R?. Suitable transformations of R? that do this are the invertible linear 
transformations. 
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Any plane that does not 
pass through the origin in 
R? will serve as an 
embedding plane. 


Subsection 3.2.1 
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If A is the matrix of an invertible linear transformation of R? to itself, the 
transformation maps points x = (x, y,z) of R? to points Ax of R*; then the 


projective transformation with matrix A maps Points [x] of RP? to Points 

[Ax] of RP*. This suggests that we define the transformations of projective 

geometry as follows. In fact, any ‘continuous’ 
transformation of RP? to 


itself that maps Lines to 
Definitions A projective transformation of RP? is a function Lines and that preserves 


t: RP? > RP” of the form incidences of Lines 
corresponds to an 
invertible linear 
transformation of R?. We 
omit a proof of this fact. 


t : [x] > [Ax], 


where A is an invertible 3 x 3 matrix. We say that A is a matrix associated 
with . The set of all projective transformations of RP? is denoted by P(2). 


Example 1 Show that the function t : RP? > RP? defined by 


t: [x,y,z] [2x +z,-x + 2y — 3z,x — y+ 5z] 


is a projective transformation, and find the image of [1, 2, 3] under rf. 


Solution The transformation t has the form t : [x] +> [Ax], where x = 


(x, y, z) and 
0 1 
A=|-Il1 2 -3 
1 -l > 
Now 
2 0 1 
det A=J-1 2 -3 
1 -1 5 
= 2110 — 3) —-0+ (1 — 2) 
= 1340. 


So A is invertible. It follows that ¢ is a projective transformation. 
We have 


1.23) Des 14 0,1 = 94 15] = +6, 14), 


Problem 1 Decide which of the following functions t from RP? 
to itself are projective transformations. For those that are projective 
transformations, write down a matrix associated with f. 


(a) t: [x,y,z] [—2y + 3z, —x + 5y — z, —3x] 
Kb) fs eye) [le = Ty aye ey = 28 = Oy 72) 
(c) t: [x,y,z] [x —14+2z,2y —4z4+5,2x] 
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Problem 2 Let ¢ be the projective transformation associated with the 


matrix. 
1 1 -l 
A=[{-l1 -—2 1 
4 -3 4 


Determine the image under ¢ of each of the following Points. 


(a) [1,2,—-1] (b)[1,0,0] (©) [0, 1, 0] 
(d) [0, 0, 1] (e) [1, 1, 1] 


Since we can multiply the homogeneous coordinates of Points in RP? by 
any non-zero real number 4 without altering the Point itself, it follows that if 
A is a matrix associated with a particular projective transformation then so is 
the matrix AA, provided that A ~ 0. For example, another matrix associated 
with the transformation in Example | is 


= Oo 
B=|{ 2 -4 6], 
3 & =10 


for we have B = —2A. 


Problem 3 Write down a matrix with top left-hand entry 5 which is 
associated with the transformation in Example 1. 


Before we can use the projective transformations to define projective 
geometry, we must first check that they form a group. 


Theorem 1 The set of projective transformations P(2) forms a group 
under the operation of composition of functions. 


Proof We check that the four group axioms hold. 
G1 CLOSURE Let f; and fz be projective transformations defined by 
t) : [x] + [Ayx] and tf: [x] + [Ad2x], 
where A, and Az are invertible 3 x 3 matrices. Then 
ty 0 t2([x]) = t (t2([x])) 
= 1 ([A2x]) 
= [(A1A2)x]. 


Since A; and Ao are invertible, it follows that A;Az 
is invertible. So by definition t; 0 f2 is a projective 
transformation. 

G2 IDENTITY Let i : RP? + RP” be the transformation defined by 


i: [x] b> [Ix], 


where I is the 3 x 3 identity matrix; this is a projective 
transformation, since I is invertible. 
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Recall that a similar result 
holds for affine 
transformations. 
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Let t: RP? — RP? be an arbitrary projective transfor- 
mation, defined by ¢ :[x] > [Ax], for some invertible 3 
x 3 matrix A. Then for any [x] € RP”, 


t oi([x]) = [A(x)] = [Ax] 


and 
io t([x]) = [(Ax)] = [Ax]. 
Thus foi = iot = t¢. Hence 7 is the identity 
transformation. 
G3 INVERSES Let t : RP? + RP? be an arbitrary projective transfor- 


mation defined by 
t: [x] > [Ax], 


for some invertible 3 x 3 matrix A. Then we can define 
another projective transformation t’ : RP* + RP? by 


t's [x] > [A74x]. 
Now, for each [x] « RP?, we have 
tot'({x]) = t(A7'x}) = [A(A“')] = [x] 
and 
to t([x]) = t/([Ax]}) = [A '(Ax)] = [x1]. 


Thus ¢’ is an inverse for f. 
G4 ASSOCIATIVITY Composition of functions is always associative. 


It follows that the set of projective transformations P(2) forms a group. BH 


The above proof shows that if t; and tg are projective transformations with 
associated matrices A; and Ag, respectively, then tf; © tz is a projective trans- 
formation with an associated matrix A;A2. We therefore have the following 
strategy for composing projective transformations. 


Strategy To compose two projective transformations ¢; and f2: 


1. write down matrices A; and A» associated with ft; and fp; 
2. calculate A; Ap; 
3. write down the composite ft; 0 tg with which A, Ag is associated. 


The proof also shows that if ¢ is a projective transformation with an associ- 

ated matrix A, then r~! is a projective transformation with associated matrix 

A7~!. We therefore have the following strategy for calculating the inverse of a See Appendix 2 for one 
projective transformation. method to calculate A7!. 
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Strategy To find the inverse of a projective transformation f: 


1. write down a matrix A associated with fr; 
2. calculate AS. 
3. write down the inverse t~! with which A7! is associated. 


Example 2 Let f; and f2 be projective transformations defined by 
ty: [x,y,z] [x +z,x + y + 3z, -2x + Zz], 
to: [x,y,z] 2x,x+y+z,4x 4+ 2y]. 
Determine the projective transformations f2 o ft; and ft, - 


Solution The transformations ft; and f2 have associated matrices 


1 0 1 2 0 0 
Ay = 1 1 3 and As=|{1 1 1 f, 
—2 0 1 4 2 0 
respectively. It follows that t2 o t; has an associated matrix 
2 0 0 1 0.4 2 0 2 
AoA; =] 1 1 1 1 1 37=]0 1 SY, 
4 2 0 —2 0 1 6 2 10 


Nie) 


toot, : [x,y,z] [2x + 2z, y + 5z, 6x + 2y + 10z]. 


Next, ft, ' has an associated matrix Ay! given by 


1 1 
3 0-3 
-l_ 7 = is 
Ay =|-3 1 -3): 
2 1 
3 0 3 


a simpler matrix associated with ft, "is then 


1 0 -1 
—7 3 -2], 
2 0 1 


so 


=] 


ty : [x,y,z] > [x — z,-7x + 3y — 2z,2x 4+ Zz]. 


Problem 4 Let r; and fz be projective transformations defined by 
tilyy,z]i 2x t+y,-x+z,y +2], 
to: [x,y,z] > [5x + By, 3x + Sy, 2z]. 


Determine the projective transformations to t2 and f)—~ y 
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Having shown that the set of projective transformations forms a group 
under composition of functions, we can now define projective geometry 
to be the study of those properties of figures in RP* that are preserved by 
projective transformations. Those properties that are preserved by projective 
transformations are known as projective properties. 


3.3.2 Some Properties of Projective Transformations 


We now check two important properties of projective transformations, namely, 
that they preserve collinearity and incidence. 

A Line in RP? is a plane in R? that passes through the origin. It therefore 
consists of the set of points (x, y, z) of R? that satisfy an equation of the form 

ax + by+cz=0, 
where a,b and c are not all zero. We can write this condition equivalently 
in the matrix form Lx=0, where L is the non-zero row matrix (a b c) and 
x=(xy zl. 

Now let t be a projective transformation defined by ft : [x] +> [Ax], where 
A is an invertible 3 x 3 matrix, and let [x] be an arbitrary Point on the Line 
Lx = 0. Then the image of [x] under ¢ is a Point [x’] where x’ = Ax. Since 
x satisfies the equation Lx = 0, it follows that x’ satisfies L(A! x’) =0, or 
(LA~!) x’ = 0. Dropping the dash, we conclude that the image of the Line 
Lx = 0 under ¢ is the Line with equation Here, LA~! is non-zero, 

(La-!) x= 0. for if 
LA~!=0, 
Since the image of a Line in RP? is a Line, it follows that collinearity is then 
preserved under a projective transformation. 

Notice that if B is any matrix associated with t— ! then B =AA7! for some 
non-zero real number A, and so (LA~!)x = 0 if and only if (LB)x = 0. It =—L (a-'a) =L, 
follows that the image of the Line can equally well be written as (LB)x = 0. ae 
(For instance, since A~! = adj(A) / det(A) so that t~! also has adj(A) as an which is not the case. 
associated matrix, we can express the image of the Line as (L adj(A))x = 0.) 

We therefore summarize the above discussion in the form of a strategy, as 
follows. 


o= (LAv')a 


Strategy To find the image of a Line 
ax+by+cz=0 
under a projective transformation ¢ : [x] +> [AX] : 


1. write the equation of the Line in the form Lx = 0, where L is the matrix 
(abc); 

2. find a matrix B associated with t—!; 

3. write down the equation of the image as (LB)x = 0. 


Example 3 Find the image of the Line 2x + y — 3z = 0 under the projective 
transformation ft; defined by 
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ti: byy,z]he [x +z,x + y+ 3z,—-2x 4+ Zz]. 


Solution The equation of the Line can be written in the form Lx = 0, where 
L=(2 1 —3). 


In Example 2 we showed that t;~! has an associated matrix 


1 0: =1 
B=|]-7 3 -2 
2 0 1 


So 


1 
Ihe 1=35)) 01st 3=7, 
a 4 


It follows that the required image has equation 


—llx+3y—7z=0. 


Problem 5 Find the image of the Line x + 2y — z = 0 under the 
projective transformation ¢; defined by 
Fy tee) Be ya ay el. 
Next, we consider the incidence property. If two Lines intersect at the Point 
P, then P lies on both Lines. So if t¢ is a projective transformation, then t(P) 
lies on the images of both Lines. It follows that the image under ¢ of the Point 
of intersection of the two Lines is the Point of intersection of the images of 
the two Lines. In other words, incidence is also preserved under a projective, 
transformation. 


P t(P) 


fo (hy) (La) 
Theorem 2 Collinearity and incidence are both projective properties. 


3.3.3 Fundamental Theorem of Projective Geometry 


In Chapter 2 we discussed the Fundamental Theorem of Affine Geometry : ; 
; : 3 : 2 : unique affine 
which states that given any two sets of three non-collinear points of R~ there is ianeformation 
a unique affine transformation which maps the points in one set to the cor- A ae a 
responding points in the other set. So an affine transformation is uniquely ae Cc / 
determined by its effect on any given triangle. B ; C’ 
In this subsection we explore an analogous result for projective geometry 
known as the Fundamental Theorem of Projective Geometry. We begin by 
asking you to tackle the following problem. 


Problem 6 Let ¢ and f% be the projective transformations with 
associated matrices 
—4 -1 1 -8 -6 -2 
A;j=|{-3 —-2 1} and Ap=] -3 4 ra ie 
4 2 -1 6 O -4 
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respectively. Find the images of the Points [1, 
{[—1,2, —1] under t and fp. 


= i, 18 Oy aad 


You should have found that both of the projective transformations f; and 
t2 map the Points [1, —1, 1], [1, —2, 2] and [—1, 2, —1] to the Points [—2, 0, 1], 
[0, 3, —2] and [1, —2, 1], respectively. Notice, however, that t; and f2 are not the 
same projective transformation, since their matrices are not multiples of each 
other. It follows that, unlike affine transformations, projective transformations 
are not uniquely determined by their effect on three (non-collinear) Points. 

This raises the question as to whether it is possible to specify how many 
Points are required to determine a projective transformation. According to 
the Fundamental Theorem of Projective Geometry, the answer is four. In 
fact the theorem states that given any two sets of four Points, no three of 
which are collinear, there is a unique projective transformation that maps the 
Points in one set to the corresponding Points in the second set. Thus, in pro- 
jective geometry a transformation is uniquely determined by its effect on a 
quadrilateral. 

To understand why a triangle is insufficient to determine a projective trans- 
formation uniquely, consider what happens when we look for a projective 
transformation that maps the triangle of reference to three given non-collinear 
Points. 


Example 4 Find a projective transformation ¢ that maps the Points [1, 0, 0], 
[O, 1, O] and [0, 0, 1] to the non-collinear Points [1,—1,1],[1,—2,2] and 
[—1, 2, —1], respectively. 


Solution Let A be a matrix associated with f, and let the first column of A 


a 
be | b |. Then since 
c 
a * 1 a 1 
b x 0 =— b = =] > 
cx 0 c 1 
1 
it follows that we may take | —1 | as the first column of A. 
1 
Similarly, since 
d 0 d 1 
e 1 = e = —2 
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unique projective 
transformation 
A am™ A’ D' 


[\ 


Recall that the Points 

[1, 0, 0], [0, 1, 0] and [0, 
0, 1] are known as the 
triangle of reference. 


Here the asterisks * 
denote unspecified 
numbers. 
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and 
* ok g 0 8g =i 
x * A = h = 2 ; 
x «x k 1 k —l 
it follows that a suitable transformation is given by ¢ : [x] > [Ax] where Notice that because the 
Points [1, —1, 1], 
Db od = [1, —2, 2] and [—1, 2, —1] 
A=]|-l —2 2]. are not collinear it follows 
] 2 -1 that the columns of A are 


linearly independent, so 
that A is invertible. 


This example illustrates the fact that we can always find a projective 
transformation f : [x] t+» [Ax] which maps the triangle of reference to 
three non-collinear Points simply by writing the homogeneous coordinates 
of the Points as the columns of A. Notice, however, that the transforma- 
tion we obtain is not unique. Indeed, if the Points [1,—1, 1], [1,-—-2,2] and 
[—1,2,—1] in Example 4 are rewritten in the form [u, —u, u], [v, —2v, 2v] and 
{[—w, 2w,—w], for some non-zero real numbers u, v, w, then the matrix 


becomes 
u v —w 
A=|-u —-2v 2w 
u 200 -w 


The corresponding transformation f : [x] +> [Ax] still maps the triangle of 
reference to the Points [1, —1, 1], [1, —2,2] and [—1, 2, —1], as required, but 
the effect that t has on the other Points of RP? depends on the numbers uw, v 
and w. 

So if we wish to specify ¢ uniquely we need to assign particular values to u, 


v and w. We can do this by specifying the effect that t has on a fourth Point Recall that the Point [1, 1, 
1, 4.1): 1] is known as the unit 
Point. 


Example 5 Find the projective transformation t which maps the Points 
[1, 0, 0], [0, 1, 0], [0, 0, 1] and [1, 1, 1] to the Points [1,—1, 1], [1, —2, 2], 
[—1,2,—1] and [0, 1, 2], respectively. 


Solution If A is the matrix associated with ft, then its columns must be 
multiples of the homogeneous coordinates [1,—1, 1], [1, —2,2],[—1,2, —1]; 
that is, 
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Also, to ensure that t maps [1, 1, 1] to [0, 1, 2] we must choose u, v and w 
so that 


u v —w 1 0 
—u —2v 2w 1 — il 
u 2v0 —w 1 2 


We can do this by solving the equations 


utv—w=0, 

—u—2v+2w=l1, 

u+2vu—w=2. 
Adding the second and third equations we obtain w = 3. If we then subtract 
the first equation from the third we obtain v = 2. Finally, if we substitute 


v and w into the first equation we obtain u = 1. The required projective 
transformation is therefore given by f : [x] + [Ax], where 


1 2 —3 
A=|-1 -4 6 
1 4 —-3 
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The columns of A are still 
linearly independent 
because they are non-zero 
multiples of the linearly 
independent vectors 
d,—1,1), (1, —2,2) and 


It is natural to ask whether the method used in this example can be adapted to 
find a projective transformation which maps the triangle of reference and unit 
Point to any four given Points. The answer is usually yes, but since collinearity 
is a projective property, and since no three of the Points [1, 0, 0], [0, 1, 0], [0, 
0, 1], [1, 1, 1] are collinear, the method must fail if three of the four given 
Points lie on a Line. Provided we exclude this possibility, the answer is yes! 


Strategy To find the projective transformation which maps 


[1,0,0] to [a1, a2, a3], 
[0, 1,0] to [by, bo, b3], 
KO, @; IU} 1 [ey C2, Call, 
[1,1,1] to [dj, do, ds], 


where no three of 
collinear: 


[a1, 42, 43], [b1, bz, 63], [c1, c2, c3], [d1,d2,d3] are 


1. find u, v, w such that 


aquu bv cw 1 d\ 
anu bov cow il |) = || @ |/8 
a3u b3v c3w 1 d3 


(17.21), 


We explain why the 
method works in the 
Remark that follows the 
strategy. 
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2. write down the required projective transformation in the form ¢ : [x] > 
[Ax], where A is any non-zero real multiple of the matrix 


qu byv Cc, W 
anu bou cC2W 
anu b3v c3w 


Remark 


To see why this strategy always works, notice that we can rewrite the equation 
from Step | in the form 


ay by al d\ 
ul aa}tuvilo]t+wla}l= {a 
a b3 C3 d3 


From this we can make the following observations. 


(a) The equation in Step | must have a unique solution for u,v, w because 
the required values of u,v and w are simply the coordinates of (dj, d2, d3) 
with respect to the basis of R* formed from the three linearly independent 
vectors (a1, 42, 43), (bj, b2, b3), (C1, €2, €3). 

(b) The values of u, v and w must all be non-zero, because otherwise 

three of the vectors (a1, a2, 43), (by, bz, b3), (cy, C2, €3), (d1, dz, d3) would 

be linearly dependent. 

Since the columns of A are non-zero, multiples of the linearly independent 

vectors (a1, a2, a3), (b1, b2, b3), (c1, C2, €3) it follows that A is invertible, 

and hence that ¢ is a projective transformation. 


(c 


wm 


There is no need to check whether any three of the four given Points are 
collinear because any failure of this condition will emerge in the process of 
applying the strategy. Indeed, if the equation in Step 1 fails to yield unique 
non-zero values for u,v and w, then it must be because three of the Points 
(a1, 42, 43), (b1, b2, b3), (c1, C2, €3), (dj, dz, d3) lie on a Line. 


Problem 7 Use the above strategy to find the projective transforma- 
tion which maps the Points [1, 0, 0], [0, 1, 0], [0, 0, 1] and [1, 1, 1] to 
the Points: 


(a) [—1,0, 0], [—3, 2, 0], [2, 0, 4] and [1, 2, —5], respectively; 
(b) [1, 0, 0], [0, 0, 1], [0, 1, 0] and [3, 4, 5], respectively; 
(c) [2, 1, 0], [1,0, —1], [0, 3, —1] and [3, —1, 2], respectively. 


Now consider the transformation t; in Problem 7(a). The inverse of this, 
t a is a projective transformation which maps the Points [—1, 0, 0], [—3, 2, 0], 
(2, 0, 4] and [1,2,—5] back to the triangle of reference and unit Point. So 
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The non-zero multiple can 
be used to clear fractions 
from the entries of the 
matrix A. 


Since no three of the 
Points [a1, a2, 43], 

[b1, bz, 63], [e1,€2,¢3], 
[d1, dz, d3] are collinear, it 
follows that any three of 
the vectors (a1, a2,q3), 
(bj, b2, 63), (c1,¢2,€3), 
(d, dz, d3) must be 
linearly independent. 
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if, after applying this inverse, we apply the projective transformation f2 in 
Problem 7(c), then the overall effect of the composite f2 of," ‘is that of a projec- 
tive transformation which sends the Points [—1, 0, 0], [—3, 2, 0], [2, 0, 4] and 
[1,2,—5] directly to the Points [2, 1, 0], [1,0,—1], [0,3, —1] and [3, —1, 2], 
respectively. 


tot! 


* [-1,0,0] + [2,1,0] 


0 [-3,2,0] @ [1,2,-5] 0 [1,0,-1] e [3,-1,2] 


*[1,0,0] 


x [0,0,1] 


In a similar way we can find a projective transformation which maps any set 
of four Points to any other set of four Points. The only constraint is that no 
three of the Points in either set can be collinear. In the following statement of 
the Fundamental Theorem we express this constraint by requiring that each of A 
the four sets of Points lie at the vertices of some quadrilateral, where a quadri- 


B 
lateral is defined as follows. A quadrilateral is a set of four Points A, B,C 
and D (no three of which are collinear), together with the Lines AB, BC, CD D 
and DA. Cc 
Theorem 3 The Fundamental Theorem of Projective Geometry unique projective 


transformation 


plo i 4 1 2 9 
Let ABCD and A’B’C'D' be two quadrilaterals in RP~. Then: Ayo A’ pt 


(a) there is a projective transformation t which maps # CN a 
D 
B' C 


Ato A’, B to B’,C toC’, D to D’; c 


(b) the projective transformation ¢ is unique. 


Proof According to the strategy above, there is a projective transformation 
t; which maps the Points [1, 0, 0], [0, 1, 0], [0, 0, 1], [1, 1, 1] to the 
Points A, B,C, D, respectively. Similarly, there is a projective transformation 
t2 which maps the Points [1, 0, 0], [0, 1, 0], [0, 0, 1], [1, 1, 1] to the Points 
A’, B’,C’, D’, respectively. 
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* [1,0,0] 


© [0,1,0] @ [1,1,1] 


x [0,0,1] 


(a) The composite t = tno ft) ‘is thena projective transformation which maps 
Ato A’, B to B’, CtoC’, Dto D’. 

(b) To check uniqueness of ¢, we first check that the identity transforma- 
tion is the only projective transformation which maps each of the Points 
[1, 0, OJ, [0, 1, OJ, [0, 0, 1], [1, 1, 1] to themselves. In fact any projective 
transformation with this property must have an associated matrix which is 
some non-zero multiple of the matrix 


u 0 O u 0 0 ih 1 
O v Oj], where | 0 v O 1}/=]{1 
0 0 w 0 0 w if 1 


Such a matrix must be (a non-zero multiple of) the identity matrix, and so 
the transformation must indeed be the identity. 


Next suppose that ¢ and ft’ are two projective transformations which satisfy 
the conditions of the theorem. Then the composites 1, ‘oO tot and ty lot’ 
o t, must both be projective transformations which map each of the Points 
[1, 0, 0], [0, 1, 0], [0, 0, 1], [1, 1, 1] to themselves. Since this implies that both 
composites are equal to the identity, we deduce that 


i ored =t,' ot’ ot, 
If we now compose both sides of this equation with fz on the left and with t, : 


on the right, then we obtain ¢ = t’, as required. |_| 


The Fundamental Theorem tells us that there is a projective transformation 
which maps any given quadrilateral onto any other given quadrilateral. So we 
have the following corollary. 


Corollary All quadrilaterals are projective-congruent. 


If we actually need to find the projective transformation which maps one 
given quadrilateral onto another given quadrilateral, we simply follow the 
strategy used to prove part (a) of the Fundamental Theorem. 
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This follows from the 
discussion leading to the 
strategy above. 


By projective-congruent 
we mean that there is a 
projective transformation 
that maps any 
quadrilateral onto any 
other quadrilateral. 
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Strategy To determine the projective transformation t which maps the 

vertices of the quadrilateral ABCD to the corresponding vertices of the 

quadrilateral A’ B’C’ D’: . a 

unique projective 

1. find the projective transformation t; which maps the triangle of reference papeouass . 
and unit Point to the Points A, B, C, D, respectively; 7 [\ 

2. find the projective transformation ¢2 which maps the triangle of reference D 
and unit Point to the Points A’, B’, C’, D’, respectively; c 


3. calculate t = fp 0 ee 


Example 6 Find the projective transformation t which maps the Points 
[1, —1, 2], [1, —2, 1], [5, -1, 2], [1, 0, 1] to the Points [—1, 3, —2], [—3, 
Ts 5], [2, 3; 4], [ 3, 8, 5], respectively. 


Solution We follow the steps in the above strategy. 


(a) Any matrix associated with the projective transformation t; which maps 
the Points [1, 0, 0], [0, 1, 0], [0, 0, 1], [1, 1, 1] to the Points [1,—1, 2], 
[1,—2, 1], [5,—1,2], [1, 0, 1], respectively, must be a multiple of the 


matrix 
u v Sw u v Sw 1 1 
—u —2v —w], where] —u —2v —w 1}/=]0 
2u v 2w 2u v 2w 1 1 
Solving the equations 
u+v+5w=1, 
—u—2v—w=0, 
2u+v+2w =, 
we obtain u = 7 v=-— 7 w= é: So a suitable choice of matrix for f, 1s 
1 1 5 is Si 7 i 
5 7-3 z 3 2 5 It is simpler to multiply 
1 2 1 er oe eae 3 4 1 the first matrix by 6 to 
~2 3 —~6 |? OF more simply Al = | — — obtain integer entries. 
1 —} ; 6 —2 2 This does not alter the 


projective transformation 
t, with which the matrix 


(b) Any matrix associated with the projective transformation f2 which maps. : 
is associated. 


the Points [1, 0, O], [0, 1, O], [0, 0, 1], [1, 1, 1] to the Points 
[—1, 3, —2], [—3, 7, —5], [2, —5, 4], [—3, 8, —5], respectively, must be a 
multiple of the matrix 


—u —3v 2w —u —3v 2w 1 —3 
3u 7u —5w |, where 3u Tv —Sw lj= 8 
—2u —5v 4w —2u —S5v 4w 1 —5 
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Solving the equations 
—u —3v+2w = -3, 
3u +7 —5w = 8, 
—2u —5v+4w =—-5, 


we obtain u = 2,v = 1, w = 1. Soa suitable choice of matrix for fz is 


FS. @ 
A={| 6 7 —5 
=4 =f 4 


(c) A matrix associated with the inverse, ft, a of t; is AG" which we can 
calculate to be 


al I 
12 12 4 
=1 1 1 * 
Aj => 0 3 6 > 
to lt el 
4 12 2 


1 -1 -3 
B=] 0 -4 -2 
—3 -1 1 
The required projective transformation is therefore t : [x] / [Ax], 
where 
—2 -3 2 1 -1 -3 
A= AoB= 6 7 —-5 0 -4 -2 
—4 —-5 4 —3 -l 1 
—8 12 14 
= 21 —29 —37 
—16 20 26 


Problem 8 Find the projective transformation ¢ that maps the Points 
[—1, 0, 0], [—3, 2, 0], [2, 0, 4], [1, 2, —5] to the Points [2, 1, 0], [1,0, —1], 
[0,3, —1], [3, —1, 2], respectively. 


Problem 9 Find the projective transformation t that maps the Points 
[1,0, —3], [1,1,-—2], [3,3,—5], [6,4,—13] to the Points [3,—5,3], 
[5.—1, 0], [3, —5, 6], [8, —13, 12], respectively. 


3.3.4 Geometrical Interpretation of Projective Transformations Nouanay combing 


subsection at a first 
In this subsection we discuss the relationship between projective transforma- reading, as it is quite hard 


tions and the perspectivities introduced in Section 3.1. going. 
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Starting from the geometric definition of perspective in Subsection 3.1.1, 
we will define the term perspective transformation, show how a perspective 
transformation may be interpreted as a projective transformation, and finally 
prove that any projective transformation can be expressed as the composite of 
at most three perspective transformations. 

So, let x and zr’ be two embedding planes in R°? that do not pass through the 
origin O in R?, and let C (4 O) be another point in R? such that OC is not 
parallel to either z or z’. Let C have position vector ¢ (based at O). Also, let 
o denote an arbitrary perspectivity from the point C that maps the plane z to 
the plane zr’. 


Now, the perspectivity o will map any point P (with position vector p) in 
onto some point P’ (with position vector p’) in z’, so long as the vector p — ¢ 
is not parallel to the plane zr’. We then define the perspective transformation 
associated with o to be the mapping of R? to itself that maps the line [p — ¢] 
onto the line [p’ — c]. But since C, P and P’ are collinear, it follows that the 
vectors p—c¢ and p’ — c must be parallel (equivalently, that [p — ce] = [p’ — ¢]); 
hence there is some real number ¢ such that 


p'—c=t(p—c). 
We can then rewrite this formula in the form 
p =1(p—c)+¢, 
or 
p =tp+ (1—2‘)e, 
so that 
[p] +> [tp + 1 — nel]. 


In this way, the perspectivity o gives a one-one mapping from z onto zr’, 
except that it is not defined on the line @ in 2 where z cuts the plane through 
C parallel to z’; also, there are no points of z that map onto points of z’ on 
the line ¢’ where z’ cuts the plane through C parallel to 7. We use the fact that 
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Recall that a projective 
transformation of RE2 is 
a map [x] — [Ax], where 
A is an invertible 3 x 3 
matrix. 


Note that O, C, P and P’ 
are coplanar. 


Recall that a line through 
the origin in R? is a 
1-dimensional vector 
space. 

The value of ¢ will depend 
on the particular point P 
under discussion. 
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the ideal Points for a plane correspond to the directions of lines in the plane, 
rather than actual points in the plane, to extend our definition of the map. 


So, first, let P be a point of z that lies on the line @; then the points O, C, P 
are not collinear, since OC is not parallel to the plane zr’. Denote the position 
vector OP by p. Let the plane through O, C and P meet the plane z’ in a line, 
and let p’ be a position vector based at O that is parallel to this line. Then we 
specify that our (extended) map o maps the line [p] through O onto the line 
[p’] through O. 

Similarly, let P’ be a point of zr’ that lies on the line @’; then the points O, 
C, P’ are not collinear, since OC is not parallel to the plane 2. Denote the 
position vector OP’ by p’. Let the plane through O, C and P’ meet the plane 
z ina line, and let p be a position vector based at O that is parallel to this line. 
Then we specify that our (further extended) map o maps the line [p] through 
O onto the line [p’] through O. 

Finally, we specify that the extended map o maps the line through O that is 
parallel to the line of intersection of z and z’ onto itself. 

In this way, we have constructed a transformation that is a one-one mapping 
of z U {the ideal Points for 7} onto x’ U {the ideal Points for zr’ | associated 
in a natural way with the given perspectivity o, and we call it the associated 
perspective transformation. This maps the family of Euclidean lines through 
O onto itself, in other words RP? onto itself. 


We now explain why we can think of this perspective transformation as a 
projective transformation. 
First, we consider a perspectivity in R? as this will prove useful later in our —_ In our discussion we shall 


discussion. So, let £ and ¢’ be two lines in R? that do not pass through the omit discussion of ‘the 
exceptional points’, for 
simplicity. 


origin O in R?, let L be the common point of the two lines, and let C(# O) 
be another point in R? such that OC is not parallel to either ¢ or ¢’. Consider 
any perspectivity 0, with centre some point C, say, that maps @ ‘onto’ ¢’. 

We are interested in the map Tt that sends lines through O to lines through O 
that is obtained from o as follows. Let o map the point P on € onto the point 
P’ on £’. Then we define t(OP) = OP’. 
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It is clear that t(OL) = OL, because L is fixed by o. 

It is also clear that t(OC) = OC, because if the line OC meets @ at P and 
the line ¢’ at P’ then O, C, P and P’ are collinear and so t(OC) = OC. 

We therefore > choose to take as basis vectors in R? the vector e = OL and 
the vector ce = OC. 

We now find the effect of the map t on a line through the origin O. We shall 
suppose that the line OC meets the line ¢ at the point with position vector ke 
and the line ¢’ at the point with position vector k’c. Any point P on the line @ 
then has position vector te + (1 — t)ke, for some real number f¢, and any point 
P’ on the line ¢’ has position vector se + (1 — s)k’c, for some real number s. on 

Next, the line OP consists of the points with coordinates u(e + mc) = ue+ O 
muc for a fixed value of m and varying values of u. This gives two (equivalent) 
expressions for the position of the point P, namely te+(1 — t)ke and we+mue. 

It follows that we must have 


u=t and mu=(1—f)k. 


Dividing the second equation by the first, we get 


1-t 
m= ——k 
t 
so that 
tm = k — kt, 
which yields the formula 
k 
t= ——_. 
m+k 


Similarly, the line OP’ consisting of points u'(e + m'c) = u’e + m'u'e, for a 
fixed value of m’ and varying values of u’, meets the line @’ at the point P’ 
where 
k/ 
om! +k 
Now, we have t(OP) = OP’ if and only if the points C, P and P’ are collinear; 
that is, if and only if there is a real number r such that 


\ 
\ 


< 
u(e+m c) 


r(te+ (l1—nke—c) =se+(1 —s)k'c—e. 


This is the case if and only if mee 


u(et+m’c) ~ 
rt=s and r(1—ft)k—-1)=(—s)k'-1. Oo 
We can eliminate r by dividing the second equation by the first equation, so 
obtaining 
Gd=O)k=1 (=k =1 


t S 
Then, if we substitute t = k_ands = _k into this equation, after some 
m+k m'+k 
manipulation we obtain the remarkable result that 
; (k — 1)k’ 
m=m 


(k’ — Dk’ 
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It follows that, in terms of the basis elements e and c, the map Tt of the family 
of lines through O to itself given by 


Cn) LO) 


can be represented by the matrix 


1 0 
(k-Dk’ | - 
0 (klk 


For convenience, we write this matrix in the form 


(i +) 


where r is a fixed number that depends only on the geometry of the two lines 
é and ¢’ and on our choice of the point C. Furthermore, r 4 0 since k 4 1 
(for k = | implies that C lies on 7) and k’ 4 0 (for k’ = 0 implies that O lies 
on zr’). 

Now we consider the situation in R3, in relation to RP?. Consider an arbi- 
trary perspectivity o, with centre some point C, say, that maps a plane z ‘onto’ 
a plane zc’, with neither O nor C lying on z or z’. As before, we shall suppose 
that the line OC meets the plane z at a point with position vector ke and the 
plane zr’ at a point with position vector k’c. 

Once again, we are interested in the map t that sends lines (in R*) through 
O to lines through O that is obtained from o as follows. Let o map a point P 
in z toa point P’ in x’. Then we define t(OP) = OP’. 

Let E,; and E> be any two points on the common line @ of the two planes. 
Then, clearly, T(OE,) = OE, and t(OE2) = OE? because every point of £ is 
fixed by o. 

It is also clear that t(OC) = OC, because if the line OC meets the plane 
x at Q and the plane z’ at Q’, then O, C, Q and Q’ are collinear, and so 
t(OC) = OC. 

We therefore now take as basis vectors in R* the vectors e; = OF, 
e2 = OEp, and ¢ = OC. 

We now find the effect of t on a line in R* through the origin O. We can 
simplify our task by observing that the lines OC, OP and OP’ all lie in a plane, 
x" say; let this plane meet the line common to the given planes z and z’ ata 
point EF. We then define e = OE. 

We can now apply our earlier discussion of the planar case to the restriction 
of the mapping Tt to the plane 7”. If we denote the lines OP and OP’ by the 
parametrizations 


OP = u(e+ me) = ue+ume,_ where m is fixed and u varies, 
OP’ =u'(e+m'c) =u’e+u'm'c, where m’ is fixed and u’ varies, 


it follows from our earlier discussion that m’ can be expressed in the form 
rm, where r is a fixed number that depends only on the geometry of the two 
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planes z and z’ and on our choice of the point C. Hence we can rewrite the 
parametrization of OP’ as 


OP’ =u (e+rme) =u'e+u'rme, where r, mare fixed and uw’ varies. 
Next, we can express the position vector e of the point E in the form 
e=fte;+(1—t)eo, forsome number ft. 
Then we have that the mapping t maps the line 
OP = u(te; + (1 — toes + me) = ute; + u(1 — theo + ume 
onto the line 
OP’ = u'(te; + (1 — Heo + rme) = w'te; + u'(1 — then +. u're. 


It follows analogously to the two-dimensional situation that, in terms of the 
basis elements e;, e2, c, the mapping t of the family of lines in R® to itself 
can be represented by the matrix 


Notice that this maps the 
line represented by the 
points [t, 1 — r, 0] to itself 
pointwise, as it should. 


1 0 0 
0 1 Of, 
0 Or 


where the (non-zero) constant r depends only on the position of the planes zr 
and zc’. Since this is an invertible 3 x 3 matrix, this is our required description 
of the perspective transformation as a projective transformation. 

We now go the other way round, and obtain a projective transformation as 
a sequence of three perspective transformations. We have a lot of freedom, 
because we can choose the centres of perspectivities and the planes. 

Suppose we are given a projective transformation t and two embedding 
planes z and z’. Let [a], [b], [ec], [d] be any four non-collinear Points, and 
[a’], [b’], [c’], [d’] be any other four non-collinear Points; we can represent 
these Points by (Euclidean) points A, B, C, D in z and A’, B’, C’, D’ inz’, 
respectively. 

We will use in our discussion the existence and uniqueness parts of the Fun- — Subsection 3.3.3, 
damental Theorem of Projective Geometry: namely, that there is one and only Theorem 3 
one projective transformation mapping four non-collinear Points to any other 
four non-collinear Points. This means that if we can find a composite of three 
perspective transformations that maps [a], [b], [ec], [d] to [a’], [b’], [e’], [d’], 
respectively, then this composite must be a projective transformation (by the 
existence part) and it must equal the given projective transformation t (by the 
uniqueness part). 

We now exhibit a sequence of three perspectivities the composite of which 
maps A, B, C, Dinz to A’, B’, C’, D’ in’. In our discussion it is convenient 
to let AB and CD meet at E, AC and BD meet at F’, and AD and BC meet 
at G, with analogous definitions of E’, F’ and G’. (You may find the figures 


Projective Transformations 


below helpful to follow through the argument; though, for simplicity, we have 
omitted the initial plane z and the points F’, F” and F’”.) 


1. The first perspectivity is from the plane z to a plane z” that passes through 
A’. The centre of this perspectivity is an arbitrary point P; on the line AA’ 
(if A = A’, then P; can be chosen to lie anywhere not on z or z’). This 
perspectivity maps A to A’ and B, C, D, E, F Gto B”, C", D", E", F", G", 
say, respectively. We can assume, by suitably varying 2”, that B’B” and 
E’E" are not parallel. 


2. Now, by the definition of E’ the points A’, B’, E’ lie on a line through 
A’, and since the points A, B, E are collinear the points A’, B”’, E” lie 
on a line through A’; so these five points lie in a plane, and because the 
lines B’B” and E’E” are not parallel they meet in a point, P2 say. We then 
pick a plane z”” through the line A’B’E’ and map the plane z” onto x” 
by the perspectivity with centre P). This sends A’, B”, E” to A’, B’, E’, 
respectively, and the points C’, D”, F”, G” to, say, C’”, D’”, F’”, G", 
respectively. As before, we can assume that D’D’” and G’G”’ are not 
parallel. 

3. Now, since A, D, G are collinear, the points A’, D’, G’ lie on a line through 
A’ and the points A’, D’”’, G’” lie on a line through A’, so these five points 
lie in a plane; and because the lines D’D’” and G’G” are not parallel they 
meet in a point, P3 say. We then map the plane z’” onto the plane x’ by 
the perspectivity with centre P3. This map sends A’, B’, D’”’, E””, G’” to 
A’, B’, D’, E’, G’, respectively. 

This third perspectivity sends the line B’G”” to the line B’G’ and the line 
DE’ to the line D’E’, so it maps the point C’” to the point C’. 


Thus the composite of these three perspectivities maps the points 
A, B, C, Dinz to A’, B’, C’, D’ inz’, as required. 
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Theorem 4 _  Perspectivity Theorem 
Every projective transformation can be expressed as the composite of three 
perspective transformations. 


3.4 __ Using the Fundamental Theorem of Projective Geometry 


In Section 3.2 we described how an embedding plane z can be used to rep- 
resent projective space RP*. The Points of RP? are represented by Euclidean 
points in and the Lines of RP? are represented by Euclidean lines in z. 

In general, any Euclidean figure in an embedding plane corresponds to 
a projective figure in RP*, and visa versa. This correspondence enables us 
to compare Euclidean theorems about a figure in an embedding plane with 
projective theorems about the corresponding projective figure. Provided that 
the theorems are concerned exclusively with projective properties, such as 
collinearity and incidence, then a Euclidean theorem will hold if and only if 
the corresponding projective theorem holds. 


3.4.1 Desargues' Theorem and Pappus’ Theorem 


The advantage of interpreting a Euclidean theorem as a projective theorem 
in this way is that we can often obtain a much simpler proof of the theorem 
than would be possible using Euclidean geometry directly. We illustrate this 
by using projective geometry to prove the theorem of Desargues. 


The Euclidean figure may 
have some ideal Points 
attached to it. 


We introduced Desargues’ 
Theorem in Subsection 
3.1.3. 


Using the Fundamental Theorem of Projective Geometry 


Theorem 1 Desargues’ Theorem 

Let AABC and AA’ B’C’ be triangles in R? such that the lines AA’, BB’ and 
CC’ meet at a point U. Let BC and B’C’ meet at P, CA and C’A’ meet at 
Q, and AB and A’ B’ meet at R. Then P, Q and R are collinear. 


Q 


Proof Because this theorem is concerned exclusively with the projective 
properties of collinearity and incidence we can interpret it as a projective the- 
orem in RP*. Moreover, by the Fundamental Theorem of Projective Geometry 
we know that any configuration of the theorem is projective-congruent to a con- 
figuration of the theorem in which A = [1,0,0], B = [0,1,0], C = [0,0, 1] 
and U = [1,1,1]. If we can prove the theorem in this special case then we 
can use the fact that projective-congruence preserves projective properties to 
deduce that the theorem holds in general. 

To prove the special case we use the algebraic techniques described in 
Section 3.2. First observe that the Line AU passes through the Points [1, 0, 0] 
and [1, 1, 1], and therefore has equation y = z. Since A’ is a Point on AU, it 
must have homogeneous coordinates of the form [a, b, b], for some real num- 
bers a and b. Now, b 4 0, since A # A’; so we may write the homogeneous 
coordinates of A’ in the form [p, 1, 1] (where p = a/b). 

Similarly, the homogeneous coordinates of the Points B’ and C’ may be 
written in the form [1, gq, 1] and [1, 1,7], respectively, for some real numbers q 
and r. 

We now find the Point P where BC and B’C’ intersect. The Line BC has 
equation x = 0. Since the Line B’C’ passes through the Points B’ = [1, qg, 1] 
and C’ = [1, 1, r], it must have equation 
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For [a, 0,0] = [1, 0, 0]. 


We omit the details of the 
calculations. 
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which we may rewrite in the form 
(qr —I)x-(r-ly+Q-q)z=0. 


It follows that at the Point P of intersection of the Lines BC and B/C’ we 
must have x = 0 and (r—1)y = (1—q)z, so that P has homogeneous coordinates 


[0,1 —q,r—1]. 
Similarly, the Points Q and R have homogeneous coordinates [1 — p, We omit the details of the 
0,r — 1] and [1 — p,q — 1,0], respectively. calculations. 


Now, the Points P, Q and R are collinear if 


0 l1-—q r-1l 
1—p 0 r=1) = 0: 
1—p q-1 0 


But 


0 l1-—q r-1l 
1—p 0 r—1 
l1—p q-1 0 


a, Si l1-—p r-l ee 0 
te of, 0 +o ofa? mr 
==—(l=—@)(l=pd=n+¢=-)0d=p@=) 
= 0, 


It follows that P, Q and R are collinear, as asserted. The general result now 
holds, by projective-congruence. | 


When using the Fundamental Theorem to simplify proofs of results in pro- 
jective geometry, we do not usually refer to projective-congruence. Instead, so 
long as the properties involved are projective properties, we content ourselves 
with an initial remark of the type: ‘By the Fundamental Theorem of Projective 
Geometry, we may choose the four Points. .., no three of which are collinear, 
to be the triangle of reference and the unit Point; that is, to have homogeneous 
coordinates [1, 0, 0], [0, 1, 0], [0, 0, 1] and [1, 1, 1], respectively’. 


Problem 1 Let AABC be atriangle in R’, and let U be any point of R* 
that is not collinear with any two of the points A, B and C. Let the lines 
AU, BU and CU meet the lines BC,CA and AB at the points A’, B’ 
and C’, respectively. Next, let the lines BC and B’C’ meet at P, AC and 
A'C’ meet at Q, and AB and A’ B’ meet at R. Prove that P, Q and R are 
collinear. 
Hint: Let A, B,C be the vertices of the triangle of reference, and let U 
be the unit Point. Then determine the homogeneous coordinates 
of the Points A’, B’ and C’. 


Using the Fundamental Theorem of Projective Geometry 


Next we use the Fundamental Theorem of Projective Geometry to prove 
Pappus’ Theorem. 


Theorem 2 Pappus’ Theorem 

Let A, B and C be three points on a line in R?, and let A’, B’ and C’ be three 
points on another line. Let BC’ and B’C meet at P,C A’ and C’A meet at 
Q, and AB’ and A’B meet at R. Then P, Q, R are collinear. 


Proof We interpret the theorem as a projective theorem, so: by the Funda- 
mental Theorem of Projective Geometry we may choose the four Points A, 
A’, P, R, no three of which are collinear, to be the triangle of reference and 
the unit Point; that is, to have homogeneous coordinates [1, 0, 0], [0, 1, 0], 
[0, 0, 1 ] and [1, 1, 1], respectively. 

First observe that the Line AR passes through the Points [1, 0, 0] and 
[1, 1, 1], and must therefore have equation y = z. Since B’ is a Point on 
AR, it must have homogeneous coordinates of the form [a, b, b] for some real 
numbers a and b. Now, b £0 since A 4 B’, so we may write the homogeneous 
coordinates of B’ in the form [r, 1, 1] (where r = a/b). 

Similarly, the Point B lies on the Line x = z through the Points A’ = 
[0, 1,0] and R = [1,1, 1], so it must have homogeneous coordinates of the 
form [1, s, 1]. 

Next we find the Point C where the Line AB intersects the Line B’ P. Since 
the Line AB passes through the Points A = [1,0,0] and B = [1,s, 1], it must 
have equation y = sz. Also since the Line B’P passes through the Points 
B’ = [r,1, 1] and P = [0,0, 1] it must have equation x = ry . At the Point C 
where AB meets B’P we have y = sz and x =ry,soC =[rs,s, 1]. 

Similarly, C’ is the point where the Line BP intersects the Line A’B’. 
Since B = [1,s5,1] and P = [0,0,1], BP has equation y = sx; and, since 
A’ = [0,1,0] and B’ = [r,1,1], A’B’ has equation x = rz. It follows that 
C’ =[r,rs, 1]. 

Finally we find the point Q where AC’ intersects A’C. Since the Line AC’ 
passes through the Points A = [1,0,0] and C’ = [r,rs,1] it must have 


L715 


This theorem is named 
after Pappus, a Greek 
mathematician who 
discovered it in the 3rd 
century AD. 
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equation y = rsz. Also the Line A’C passes through the Points A’ = [0, 1, 0] 
and C = [rs,s,1] so it must have equation x = rsz. At the Point Q where 
AC’ intersects A’C we have y = rsz and x =rsz,so Q =[rs,rs, 1]. 
To complete the proof we simply observe that the Points R = [I, 1, 1], A fortunate choice of 
OQ =[rs,rs, 1] and P = [0,0, 1] all lie on the Line x = y. It follows that P,Q _ Points for the triangle of 


and R are collinear. o reference and unit Point 
meant that we did not 


Although we can sometimes simplify the proof of a Euclidean theorem by _ have to use the 
determinant criterion for 
collinearity at the final 
stage of the argument. 


using projective geometry, there is another more subtle reason for interpreting 
a Euclidean theorem as a projective theorem. By doing so we can often avoid 
having to make special provision for exceptional cases, such as when two lines 
are parallel. In projective geometry, Lines which correspond to a pair of paral- 
lel lines in an embedding plane actually meet and are therefore no different to 
any other Lines. 

As an example, consider the diagram in the margin. This illustrates the sit- 
uation that occurs in Pappus’ Theorem when the Point of intersection R of 
A’B and AB’ is an ideal Point for the embedding plane. The above proof of 
Pappus’ Theorem is able to cope with this situation because it uses arguments 
from RP*! Our interpretation of the theorem on an embedding plane in this 
situation is that the Points P and Q must be collinear with the ideal Point R at 
which A’B and AB’ meet. That is, PQ must be parallel to both A’B and AB’. 


Problem 2_ Give a Euclidean interpretation of Desargues’ Theorem on 
an embedding plane z in the case where Q is an ideal Point for zr. 


3.4.2 Duality 


Recall that two key projective properties that we have met so far have a certain 
symmetry between them. 


Collinearity Property Incidence Property unique unique 
Line Point 


Any two distinct Points Any two distinct Lines 
lie on a unique Line. meet in a unique Point. 


We can obtain one property from the other simply by interchanging the 
words ‘Point’ and ‘Line’, and making whatever other changes are needed to 
ensure that the sentence makes sense. We say that this interchanging process 
dualizes one statement into the other, and that each statement is the dual of the 
other. 

For example, ‘a family of Points on a Line’ becomes ‘a family of Lines 
through a Point’ under dualization. Similarly, ‘a triangle’ or ‘a family of three 
non-collinear Points and the three Lines joining them’ dualizes to ‘a family of 
three non-concurrent Lines and the three Points where they meet’, which is 
again a triangle. Since a triangle is thus dual to a triangle, we say that triangles 
are self-dual figures. 
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Se CA 


The dualization process is particularly interesting when applied to theorems. 
We shall illustrate this in the context of Pappus’ Theorem. In order to do this, it 
is helpful to rephrase Pappus’ Theorem using the term hexagon. As you would 
expect, a hexagon in RP” consists of six Points joined by six Lines. The figure 
below illustrates (Euclidean) hexagons in an embedding plane; the correspond- 
ing hexagons in RP? are the corresponding six Points and six Lines — that we 
model as six lines and six planes in R?. 


OK yeh 


We can now rephrase Pappus’ Theorem in the following form. 


Theorem 3 Pappus’ Theorem (rephrased) 

Let the vertices A, B’,C, A’, B and C’ of a hexagon lie alternately on two You should compare this 
different Lines. Then the Points of intersection of opposite sides B’C and _ formulation with that in 
BC’, CA’ and C’A, AB’ and A’B, are collinear. Subsection 3.4.1, 


Al B' C' 


If we dualize this theorem, we obtain the following theorem. 
Charles J. Brianchon 
Theorem 4 _Brianchon’s Theorem (1785— 1864) was one of 
Let the sides AB’, B'C, CA’, A’B, BC’,C’A of a hexagon pass alternately = ™4@"Y distinguished 
through two (different) Points P and Q in RP*. Then the Lines joining Ereich geotuciers wie 


i ; studied under Gaspard 
opposite vertices A and A’, B and B’, C and C’, are concurrent. Monge (1746—1818). 
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Problem 3 Prove Brianchon’s Theorem. 
Hint: Let P,C, Q,C’ be the Points [1, 0, 0], (0, 1, 0], [0, 0, 1, [1, 1, U1, 


respectively. 


It turns out that the dual of any true statement concerning Points, Lines and 
their projective properties remains true after dualization; that is, if we dualize 


any theorem in projective geometry, then the statement that we obtain is itself 


a theorem. 


Problem 4 Earlier you saw that ‘three Points [a,b,c], [d,e, f], 


[g,h,k] are collinear if and only if jd e 


dual result of this statement. 


abe 
f| = 0'. Write down the 
g h_ k 


We end this subsection by forming the dual of Desargues’ Theorem, as 


follows. 


Desargues’ Theorem 


Let two triangles be such that the 
Lines joining corresponding 
vertices meet at a Point. 

Then the Points of intersection of 
the corresponding sides of the two 
triangles are collinear. 


Dual Theorem 


Let two triangles be such that the 
Points through which correspond- 
ing sides pass are collinear. 

Then the Lines through the corre- 
sponding vertices of the two 
triangles are concurrent. 


Note that the dual theorem is simply the converse result for Desargues’ The- 
orem! Thus the Principle of Duality enables us to deduce that the converse of 


Desargues’ Theorem holds. 


We do not prove this 
assertion, as it would take 
us beyond the scope of 
this book. 


Subsection 3.2.2, Strategy 


Cross-Ratio 


3.5 Cross-Ratio 


3.5.1 Another Projective Property 


Earlier, in Subsection 2.2.1, we noted that ratio of lengths along a line is an 
affine property. Thus, in affine geometry, if we are given two points P and 
Q on a line @, then we can locate the position of a third point R along @ by 
specifying the ratio PR : RQ. In particular, it is possible to talk about the 
point midway between P and Q. 

In projective geometry it is meaningless to talk about the Point midway 
between two other Points. In one embedding plane z a Point R may appear 
to be midway between the Points P and Q, whereas in another embedding 
plane z’ the ratio PR : RQ may be very different. 


This ambiguity arises from the fact that perspectivities do not preserve the 
ratio of lengths along a line, so: ratio of lengths along a line is not a projective 
property. 

In some embedding planes, such as the plane zr’ illustrated in the margin, 
the Point R does not even appear to lie between P and Q, so betweenness is 
not a projective property either! 

Fortunately, there is a quantity, known as cross-ratio, that is preserved 
under all projective transformations. To see how this is defined, consider four 
collinear Points A = [a], B = [b], C = [c], D = [d] in RP*. We can express 
the fact that A, B, C, D are collinear by writing c and d as linear combinations 
of a and b. Thus we can write 


c=aa+ fb and d=ya-+ db, 


for suitable real numbers a, f, y, 6. 
The cross-ratio is then defined to be the ratio of the ratios B and a 
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Definition Let A, B,C, D be four collinear Points in RP? represented by 
position vectors a, b, c, d, and let 


c=aa+ fb and d=ya-+ ob. 


Then the cross-ratio of A, B, C, D is Equivalently, we can write 
8/6 (ABCD) = 2. 
(ABCD) = — / -. 
af y 


Of course, before we can be sure that this definition makes sense, we must 
ensure that it does not depend on the particular choice of position vectors a, 
b, c, d that are used to represent the Points A, B,C, D. We shall check this 
shortly, but first we illustrate how cross-ratios are calculated. 


Example 1 Let A = [1,2,3], B = [1,1,2], C = [3,5,8], D = [1, —1,0] be 
Points of RP*. Calculate the cross-ratio (ABCD). 


Solution First, we have to find real numbers a@ and £ such that the following 


vector equation holds: Note that we have not 
verified that A, B,C, D 
(3,5, 8) = a(1, 2,3) + BCL, 1,2). are collinear; but if they 
were not, the equations for 
a, B, y, 6 could not be 
solved. 


Comparing corresponding coordinates on both sides of this vector equation, 
we deduce that 


3=a+f8, 5=2a+f6 and 8=30+26. 


Solving these equations gives a = 2, B = 1. 
Next, we find real numbers y and 6 such that the vector equation 


(1, —1,0) = yd, 2,3) + 61, 1, 2) 


holds. Comparing corresponding coordinates on both sides of this vector 
equation, we deduce that 


l=y+6, -l=2y+6 and 0=3y +26. 


Solving these equations gives y = —2, 6 = 3. 
It follows from the definition of cross-ratio that 


aasen) = 2/2 -5 [== a 
afy 2 —2 3 


Problem 1 Calculate the cross-ratio (ABCD) for each of the following 
sets of collinear Points in RP”. 


(a) A=[1,—1,-1], B =[1,3, —2], C = [3,5, —5], D=[1, —5, 0] 
(b) A =[1, 2,3], B =[2, 2, 4], C =[—3, —5, —8], D = [3, —3, 0] 


Cross-Ratio 


You may have noticed that the Points A, B,C, D in Problem | (b) are the 
same as those which appear in Example |. The only difference is that different 
homogeneous coordinates are used to represent the Points in each case. As 
we mentioned after the definition of cross-ratio, the value of the cross-ratio 
(ABCD) does not depend on the homogeneous coordinates that are used to 
represent A, B,C, D, so it is not surprising that the cross-ratio turned out to 
have the value —4 in both cases. 


Theorem 1 The cross-ratio (ABCD) is independent of the homogeneous 
coordinates that are used to represent the collinear Points A, B,C, D. 


Proof Suppose that A = [a], B = [b], C = [c], D = [d], and let 
c=aa+fPb and d=ya-+ db. (1) 
Now suppose that A = [a’], B = [b’], C = [c’], D = [d’]. Then 
a=aa’, b=bb, c=cc, d=dd, 


where a,b,c,d are some non-zero real numbers. 
By substituting these expressions into the equations (1), we obtain 


ce’ =aaa'+ Bbb' and dd’ = yaa’ + bbb’, 
which we can rewrite in the form 
c =a'a'+ fb’ and d=y'a'+5'b, (2) 


where a’ = aa/c, B’ = Bb/c, y’ = ya/d, 5’ = bb/d. 
We can now check that equations (1) and (2) yield the same value for the 


cross-ratio: 
ak _ maker 
a} vy’ aa/cf ya/d 
Bb [6b 
aal ya 
_# j u 
af y 
So, as expected, the cross-ratio is independent of the choice of homogeneous 
coordinates. | 


The next problem illustrates that although the value of the cross-ratio 
(ABCD) is independent of the choice of homogeneous coordinates that are 
used to represent A, B, C, D, the value of the cross-ratio does depend on the 
order in which the Points A, B, C, D appear. 


Problem 2 _ Calculate the cross-ratios (BACD) and (ACBD) for the four 
Points used in Problem l(a). 
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When answering Problem 2 you may have noticed that (BACD) is the recip- 
rocal of the value which we obtained for (ABCD) in Problem l(a). Also, 
(ACBD) is equal to | — (ABCD). The next result shows that this is not simply 


chance! 


Note the way in which the 
Points are changed from 
their original ordering in 


Theorem 2 Let A, B,C, D be four distinct collinear Points in RP?, and the various cross-ratios. 


let (ABCD) = k. Then 
(BACD) = (ABDC) = 1/k, 
ACED) =] MEGA a1. 


We take the reciprocal 
when swapping the first or 
last pair of Points, and we 
subtract from 1 when 
swapping the inner or 
outer pair of Points. 


Proof Let a, b, c, d be any position vectors in R? in the directions of the 
Points A, B, C, D, respectively, of RP’, and let a, 6, y,6 be real numbers such 


that 


c=aa+ 6b and d=ya-+odb. 


Then, by definition of cross-ratio, the cross-ratio (ABC D) of the four Points 


A, B,C, D is the quantity 


5 
(ABCD) = el; a 
af y aé 


To determine (BACD), we interchange the roles of A and B in the evaluation 


of ABCD above; it follows that, since 


c=fb+aa and d=dd+ya, 


the cross-ratio (BACD) is the quantity 


(waco) = % [E = | 
BL 8 By ke 


To determine (ABDC), we interchange the roles of C and D in the evaluation 


of (ABCD) above; it follows that, since 
d=ya+dédb and c=aa+ fb, 


the cross-ratio (ABDC) is the quantity 


gang [2 
—yfa py k 


To evaluate (ACBD), we use the equations 
c=aa+fb and d=ya+d6b 


to express b and d in terms of a and ¢, as follows. 


This completes the first 
part of the proof. 


(3) 
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From the first equation in (3) we have B cannot be zero, for if it 
were then we would have 
b = (c— aa)/B c = aa; this cannot happen 
since A and C are distinct 
= (—a/B)a + (1/B)e. Cee 
If we then substitute this expression for b into the second equation in (3), 
we obtain 


d= ya+6((—a/B)a+ (1/B)e) 


= ((By — a5)/f)at (6/A)e. (5) 
It follows from the coefficients of a and ¢ in equations (4) and (5) that 
(ACBD) = 1/6 6/B 


—a/B/ (By — 06)/B 


Finally, we can use the previous parts of the proof to evaluate (DBCA), as This avoids the algebra 
follows: involved in expressing c 
and a in terms of d and b. 


(DBCA) = 1/(BDCA) (swap first two Points) 
= (BDAC) (swap last two Points) 
= | — (BADC) (swap middle two Points) 
= 1-—1/(ABDC) (swap first two Points) 
= | — (ABCD) (swap last two Points) 
=1-k. a 


Earlier, we showed that the cross-ratio (A BC D) of the four collinear Points Example 1 
A=[1, 2,3], B=[I, 1, 2], C =[3, 5, 8], D =[1, -1, 0] in RP? is —}. 
Theorem 2 enables us to deduce that 


(BACD)=—3, (ABDC) =—3, 
(ACBD) = 5, (DBCA) = 3. 


Problem 3 Let the Points A = [1,—1,—1], B = [1,3,—2], C = 
[3,5,-5], D = [1,—5,0] be collinear Points of RP?. By applying 
Theorem 2 to the solution of Problem l(a), determine the values of the 
cross-ratios (ABDC), (DBCA) and (ACBD). 


The next theorem confirms that cross-ratio is preserved by projective 
transformations. 
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Theorem 3 Let ¢ be a projective transformation, and let A, B, C, D be any 
four collinear Points in RP?. If A’ = f(A), B’ = t(B),C’ = t(C), D! = 
t(D), then 


(ABCD) = (A'B’C'D’). 


Proof Let ¢ be the projective transformation t : [x] +> [Ax], where A is an 
invertible 3 x 3 matrix. If A = [a], B = [b], C = [c], D = [d], and 


a’ = Aa, b' = Ab, ec’ = Ac, d’ = Ad, 


then A’ = [a’], B’ = [b’], C’ = [c’], D’ = [d’]. 
Since A, B,C, D are collinear, we can write 


c=aa+f6b and d=ya-+odb, (6) 


(ABCD) = o/s. 
af y 


Multiplying each equation in (6) through by A, we obtain 


sO 


ce =aa'+ pb’ and d’=ya'+ 6b’, 


(A'R'C’D'y= ele. 
a/ y 


so that 


It follows that 
(A'B'C'D’) = (ABCD). Oo 
We now use Theorem 3 to prove that if four distinct Points on a Line are in 


perspective with four distinct Points on another Line, then the cross-ratios of 
the four Points on each Line are equal. 


Theorem 4 Let A, B, C, D be four distinct Points on a Line, and 
let A’, B’,C’,D’ be four distinct Points on another Line such that 
AA’, BB’, CC’, DD’ all meet at a Point U. Then 


(ABCD) = (A'B’C'D’). 


Proof By the Fundamental Theorem of Projective Geometry, there is a 
unique projective transformation t which maps B to B’,C to C’, B’ to B, 
and C’ to C. We shall show that t(A) = A’ and t(D) = D’, and hence by 
Theorem 3 it follows that (ABCD) = (A’B’C’D’). 

First observe that the composite t 0 f fixes the Points B,C, B’ and C’. By 
the Fundamental Theorem of Projective Geometry, the only projective trans- 
formation which does this is the identity transformation, so tot = i and 
t=, 
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Next observe that t maps the Line BC onto the Line B’C’, and vice versa; 
so the Point T at which BC and B’C’ intersect must be fixed by r. Also, t maps 
the Lines BB’ and CC’ onto themselves, so their Point of intersection U must 
be fixed by ¢. 

Now let X be the image of A under t. Then X lies on B’C’. We want to 
show that X = A’. 

Suppose that X 4 A’; then AX cannot pass through U so it must intersect 
BB’ at R and CC’ at S, where R, S and U are distinct Points. 

Since ¢ is self-inverse, it maps X back to A and therefore maps AX onto 
itself. But this implies that ¢ fixes the four Points R, S, J; U; so by the Funda- 
mental Theorem of Projective Geometry f must be the identity transformation. 
This is a contradiction with the hypothesis that the Lines ABCD and A’ B'C’ D’ 
are different. It follows that we must conclude that X = A’, that is, (A) = A’. 
A similar argument shows that t(D) = D’. 

Finally, it follows by Theorem 3 that (ABCD) = (A'B‘C'D’), as 
required. | 


In affine geometry, if we are given two points A and B, then the ratio 
AC/CB uniquely determines a third point C on the line AB. We now explore 
the analogous result for projective geometry, namely that if we are given 
any three collinear Points A, B,C in RP?, then the value of the cross-ratio 
(ABCD) uniquely determines a fourth Point D. 


Theorem5 Unique Fourth Point Theorem 

Let A, B,C, X,Y be collinear Points in RP? such that 
(ABCX) = (ABCY). 

‘Woven A = 


Proof Let A = [a], B = [b], C = [c], X = [x], Y = [y]. Since A, B,C, X,Y 
are collinear, it follows that there are real numbers a, 6, y, 6, A, 4 such that 


c=aa+ fb, x=ya+6b and y=dAa+ wpb. (7) 
Then 
Xr 
(aBcx) = ©” and (apcy) = 2. 
ad al 
Since (ABCX) = (ABCY), it follows that This is the hypothesis of 
the theorem. 
y_A 
5! 


so ’A = yp/d. If we substitute this value of 4 into the expression for y in 
equation (7), we obtain 


y= (ve /d)a+ wb = (u/5)(ya t+ db) = (/8)x. 


Since y is a scalar multiple of x, it follows that X = Y, as required. im 
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In Theorem 4 we showed that the cross-ratios (ABCD) and (A’B’C'D’) are 7 
equal if the Points A’, B’,C’, D’ are in perspective with the Points A, B, C, D. 
Our next result is a partial converse of this result. 


Theorem 6 Let A, B,C, D and A, E, F,G be two sets of collinear Points 
(on different Lines in RP?) such that the cross-ratios (ABCD) and (AEFG) 
are equal. Then the Lines BE, CF and DG are concurrent. 


Proof Let P be the Point at which the Lines BE and CF meet, and let X be 
the Point at which the Line PG meets the Line ABCD. Then the Points A, B, C 
and X are in perspective from P with the Points A, E, F and G, so that 


(ABCX) = (AEFG). 
Since we know that (AEFG) = (ABCD), it follows that 
(ABCX) = (ABCD). 


By Theorem 5, we must therefore have X = D. Hence the Points A, B, C, D 
and the Points A, E, F G are in perspective from P. ai) 


We can now use Theorem 6 together with the other properties of cross-ratio 
to give a second proof of Pappus’ Theorem. 


Theorem 7 Pappus’ Theorem You met this Theorem 
Let A, B and C be three Points on a Line in RP”, and let A’, B’ and C’ earlier, in Subsection 
be three Points on another Line. Let BC’ and B’C meet at P, CA’ and C’A 34.1. 

meet at Q, and AB’ and A’B meet at R. Then P, Q and R are collinear. 


Proof Let V be the Point of intersection of the two given Lines. Also let the 
Lines BA’ and AC’ meet at the Point S, and the Lines BC’ and CA’ meet at the 
Point 7. 
Now, the Points V, A’, B’, C’ are in perspective from A with the Points B, 
A’, R, S, so that 
(VA'B'C') = (BA'RS). (8) By Theorem 4 
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Similarly, the Points V, A’, B’, C’ are in perspective from C with the Points 
B,T,P,C’, so that 


(VA'B'C') = (BTPC’). (9) 
It follows from equations (8) and (9) that 
(BA'RS) = (BTPC’), 


so that by Theorem 6 the Lines A’T, RP, SC’ are concurrent. 

We may rephrase this statement as follows: the Line RP passes through the 
Point where A’T meets SC’; that is, the Line RP passes through Q. In other 
words, P, Q and R are collinear. |_| 


3.5.2 Cross-Ratio on Embedding Planes 


So far, we have calculated a given cross-ratio (ABCD) by applying the defini- 
tion of cross-ratio directly to the Points A, B,C, D. However, it is sometimes 
convenient to evaluate the cross-ratio by examining the representation of the 
Points on some embedding plane. 

Suppose that four collinear Points of RP” pierce an embedding plane z at 
the points A, B,C, D with position vectors a, b, c, d, respectively. 

According to the Section Formula, we can write ¢ and d in the form 


c=iAa+(1—A)b and d=ypa+(1—yp)b, 


where (1 — A) : A is the ratio AC : CB, and (1 — 2) : yw is the ratio AD : DB. 
Then from the definition of cross-ratio 


1-i’A /l-p 
(ABCD) = —. | —, 
A a 
SO 
(ABCD) = Oe Gaus (10) 
~ CB/ DB’ 


Example 2 In an embedding plane, the points A, B,C, D lie in order along 
a line with the distances AB,BC, CD being | unit, 3 units and 2 units, 
respectively. Determine the cross-ratios (ABCD), (BACD) and (ACBD). 


Solution Using equation (10) and the sign convention for ratios, we have 
AC [AD 4 6 10 
(ABCD) = = --)=—, 
CB/ DB 3 5 9 
BC /BD 3 3 9 
(BACD) = = Se 
CA/ DA 4 6 10 


See Appendix 2. 
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AB / AD (:) At *) I 
Acapy=—— / = {- a eo 
BC/ DC \3 2 9 


and 


Problem 4 The points A, B,C, D lie in order along a line with the 
distances AB, BC, CD being 2 units, 1 unit and 3 units, respectively. 
Determine the cross-ratios (ABCD) and (DBCA). 


Sometimes one of the Points whose cross-ratio we are trying to find turns 
out to be an ideal Point for the embedding plane. In such cases, formula 
(10) cannot be used since some of the distances in the formula will not be 
defined. 

To be specific, suppose that the Points A, B, C, D are collinear, but that A is 
an ideal Point for the embedding plane 7, as shown in the margin. As before, Pee 
we can let b, c, d be the position vectors of the points B,C, D on z, but we representing NS 
take a to be a unit vector along A. Then ae 


c=-—(CB)a+b and d=-—(DB)a+b. 


From the definition of cross-ratio, it follows that 


1 1 DB 
(ABCD) = — (11) 
=CB8/ =DB CB 


We can now obtain the corresponding formulas for the cases where B,C 
or D is an ideal Point, by applying Theorem 2. For example, if B is an ideal 


Point, then 
(ABCD) = (BACD) (swap first two terms) 
= (BADC) (swap last two terms) 
ae b tion (11) 
=a y equation , 


Problem 5 Use Theorem 2 To prove that: 


(a) (ABCD) = a if D is an ideal Point; 
(b) (ABCD) = oe if C is an ideal Point. 


We now summarize the various formulas for cross-ratio in the form of a 
strategy, as follows. 


Strategy To use an embedding plane to calculate the cross-ratio of four 
collinear Points: 


Cross-Ratio 


1. if the four Points pierce the embedding plane at A, B, C, D, then 


AC / AD 
(AB6D = 
cB / DB 


2. if one of the Points is an ideal Point for the embedding plane, then 
DB ean 
(ABCD) = — if A is ideal, 
CB 
(ABCD) = ce if B is ideal 
= yi if B is ideal, 
LED a at 8 
(ABCD) = ——~ if C is ideal, 
AD 


AC , er 
(ABCD) = — if D is ideal. 
Be 


Example 3 Determine (ABCD) for the collinear points A, B,C, D illustrated 


in the margin, where C is an ideal Point. 
Solution Since C is an ideal Point, we have 


BD 4 
(ABCD) = — = - =4. 


AD 1 


Problem 6 Determine (ABCD) for the collinear points A,B,C, D 
illustrated in the margin, where B is an ideal Point. 


3.5.3 An Application of Cross-Ratio 


Earlier, we described how projective geometry can be used to obtain two- 
dimensional representations of three-dimensional scenes. We now describe 
how cross-ratios can be used to obtain information about a three-dimensional 
scene from a two-dimensional representation of the scene. We do this in the 
context of aerial photography. 

For simplicity, consider an aerial camera that takes pictures on a flat film 
behind its lens, L, of features on a flat piece of land in front of L. Since a point 
on the ground lies on the same line through L as its image on the film, we can 
regard the process of taking a photograph as a perspectivity centred at L. 

Since collinearity is invariant under a perspectivity, the image of any line 
£ on the ground is a line on the film. Moreover, the cross-ratio of any four 
collinear points is invariant under a perspectivity, so the cross-ratio of any four 
points on £ must be equal to the cross-ratio of their images on the film. 
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Subsection 3.1.1 


Section 3.3, Theorem 2 


Theorem 3 
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perspective 


aerial image of 
camera ground 
ae 7 
ee a: 
Par = / 
aa = flat 
ane / ground 
a i 


Example4 An aerial camera photographs a car travelling along a straight road 

on flat ground towards a junction. Before the junction there are two warning 

signs at distances of 4 km and 2 km from the junction. On the film the signs 

are | cm and 3 cm from the junction, and the car is 3 cm from the junction. — Strictly speaking, the car 
How far is the car from the junction on the ground? is not in line with the two 
signposts. Consequently, 
the distances marked on 
the photograph are 
approximations measured 
along the line of the 
left-hand kerb of the road. 


car 2 km sign 
3cm 
2 km sign = 
4 km sign I v 
4km sign = 
ground aerial 
plan photograph 
i 3 om 
Solution Let A and B denote the signs, C denote the car, and D denote the tat 7 
junction, and let A’, B’,C’, D’ be their images on the film. Then 
R 
A'C’ A’D’ 
(A’B'C'D') = i 3cm 
C’B’ D’'B’ 
_ 18/7 3 
7 4/7 i 
=3 v A 
=e 
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Now let the car be n km from the junction. Then 


rn D 
AC /AD 
(gcd) = — (—— nkm 
CB/ DB sua be 
_ 4—n 4 
~ J=7 2 4km P 
_ #=s 
~ 2(2—n) 
Since (ABCD) and (A’B'C’D’) must be equal, it follows that Y A 
=n 3 
22—n) 2° 


Hence 


4-n=3(2—-n). 


and so n = 1. That is, the car is 1 km from the junction. 


1 cm ¢ 


Problem 7 An aerial camera photographs a car travelling along a 
straight road on flat ground towards a junction. Before the junction there 
are two warning signs, at distances of 2 km and 3 km from the junction. 
On the film the signs are 4 cm and 6 cm from the junction, and the car = 3kmsign g 
is 1 cm from the junction. How far is the car from the junction on the 

ground? 


6cm 


2kmsign FW 


aerial 
photograph 


If two lines that are known to be parallel on the ground appear to meet on the 
film, then the point of intersection on the film corresponds to the ideal Point 
where the ‘parallel lines meet’. We can therefore use the above technique even 
when one of the Points is ideal, for we can use the second part of the strategy 
in Subsection 3.5.2 to calculate the cross-ratio whenever one of the Points is 
ideal. 


Problem 8 An aerial camera photographs a train travelling between 
two stations along a straight track on flat ground. The stations are 50 km 
apart. When the film is inspected, the stations are 4 cm apart, the train 
is midway between the stations, and the rails appear to meet (or vanish) 
4 cm beyond the station towards which the train is travelling. How far 
has the train to travel to the next station? 
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3.6 Exercises 


Section 3.2 


1. (a) Write down numbers a, b,c and d such that 
[1.a,6]=[-$.3.4] and [e,d,2] =[3,0, 1) 


(b) Which of the following homogeneous coordinates represent the same 
Point of RP? as [4, —8, 2]? 


@U4-2 Gd [-35] ad [-3.-2.1] 
vy [-2.4,-11 ) [3-34] 


2. Determine an equation for each of the following Lines in RP?: 
(a) the Line through the Points [1, 2, 3] and [3, 0, —2]; 
(b) the Line through the Points [1, —1, —1] and [2, 1, —3]. 
3. Determine whether each of the following sets of Points are collinear: 
(a) [1,—1,0], [1,0, —1] and [2, —1, —1]; 
(b) [1, 0, 1], [0, 1, 2] and [1, 2, 3]. 
4. Determine the Point of intersection of each of the following pairs of Lines 
in RP?: 
(a) the Lines with equations x — 2y + z=Oandx —y—z=0; 
(b) the Lines with equations x + 2y + 5z = O and 3x —y+z=0. 
5. Determine the Point of RP? at which the Line through the Points [8, —1, 2] 
and [1, —2,—1] meets the Line through the Points [0, 1, —1] and [2, 3, 1]. 
6. Determine the Point of RF? at which the Line through the Points [1, 2, 2] 
and [2, 3, 3] meets the Line through the Points [0, 1, 2] and [0, 1, 3]. 


Section 3.3 
In these exercises, you may find the following list of matrices and their inverses 
useful. 
2 1 0 —2 O 1 0 3 -I1 0 3 4 
A:| -1 0 1 0 3 -2 2 0 -1 -1 3 2 
0 1 1 I =3 I 0 0 1 3. -3 3 
o i t 
1 1 1 1 1 1 2 2 5 -7 -2 
-1 2 4 4 
A -1 —2 2 —3 —-l -3 5 0 5 3. -4 -3 
1 Z 1 1 2 001 —2 3 1 


1. Determine which of the following transformations t¢ of RP” are projective 
transformations. For those that are projective transformations, write down 
a matrix associated with f. 
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(a) t: [x,y,z]  [2x, y + 3z, 1] 
(b) t: Lx, y,z] > [x,x —y4+3z,x+ y] 
(c) ¢2 [wey 2] > ey. y— ze] 


(d) t: [x,y,z] [x+y —z,y + 3z,x + 2y 4 22] 
2. Determine the images of the Points [1, 2, 3], [0, 1, 0] and [1, —1, 1] under 
the projective transformation f associated with the matrix 


2 0 1 
A=]{-1 1 0 
0 1 1 


3. Let 
ty : [x,y,z] b> [2x+y,-x+z,y+2z], 
ta: [x,y,z] [x + y,3x — z,4y — 2z] 


be projective transformations from RP? to RP?. 
(a) Write down matrices associated with each of t; and fo. 
(b) Determine formulas for fz 0 t) and f2 0 ft, : 
4. Find the image of the Line x + 2y + 3z = O under the projective 
transformation ft; defined in Exercise 3. 
5. Determine matrices for the projective transformations which map the Points 
[1, 0, 0], [0, 1, 0], [0, 0, 1] and [1, 1, 1] onto the following Points: 
(a) [—2,0, 1], [0, 1, —1], [—1,2,—1] and [—1, 1, —1]; 
(b) [0, 1, 0], [1, 0, OJ, [—1, —1, 1] and [2, 1, 1]; 
(c) [0, 1, —3], [1, 1, —1], [4, 2, 3] and [7, 4, 3]. 
6. Use the results of Exercise 5 to determine the projective transformations 
that map: 
(a) the Points 


[—2,0, 1], [0,1,—1], [-1,2,-1], [-1,1,-H 


to the Points 


[0, 1, 0], [1,0, Ol, [=1, =1, 1], [2, 1, 1], 


respectively; 
(b) the Points 
[0, 1,0], [1, 0, O], [—1, -l, 1], [2, 1, 1] 
to the Points 
[0, 1; —3], [1, iF —1], [4, 2s 3], [7,4, 3], 
respectively; 
(c) the Points 


[0, 1,=3], [1, 1,—1], [4, 2, 3], [7,4, 3] 
to the Points 


[—2,0, 1], [0,1,—1], [—-1,2,-], [-1,1,-1], 


respectively. 
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Section 3.4 


1. For which of the following configurations of Points A,B,C and D in 
RP? is there a projective transformation sending A, B, C to the triangle of 
reference and D to the unit Point? 


D 


2. Let AABC be a triangle in R, and let U be any point of R® that is not 
collinear with any two of the points A, B, C. Let the Lines BC and AU meet 
at P CA and BU meet at Q, and AB and CU meet at R. Prove that P Q, R 
cannot be collinear. 


Section 3.5 


1. For each of the following sets of Points A, B, C, D, calculate the cross-ratio 
(ABCD). 
(a) A= ([2,1,3], B = [1,2,3], C = [8,1,9], D = [4, -1, 3] 
(b) A= [2,1,1], B = [-—1,1,-—1], C = [1,2,0], D = [—1,4, —2] 
(c) A=[-1,1,1], B = [0,0,2], C = [5, —5,3], D = [-3,3,7] 

2. For the Points A,B,C,D in Exercise l(a), determine the cross-ratios 
(BACD), (BDCA) and (ADBC). 

3. For each set of collinear points A, B,C, D illustrated below, calculate the 
cross-ratio (ABCD). 


4. Calculate the cross-ratio (ABCD) for the collinear points A, B,C, D illus- 
trated below, where D is an ideal Point. 


Cc B A D 


Sheer lights, 


5. The diagram in the margin represents an aerial photograph of a straight road _ Film image 


aerial 


on flat ground. At A there is a sign “Junction | km’, at B a sign ‘Junction Photograph 


Summary of Chapter 3 


5 km’, and C is the road junction. Also, a police patrol car is at X, and a 
bridge is at Y. The distances marked on the left of the diagram are measured 
in cm from the photograph. 

Calculate the actual distances (in km) of the patrol car and the bridge 
from the junction. 


Summary of Chapter 3 


Section 3.1: Perspective 


1. 


Renaissance artists used terraced perspective and later vertical perspec- 
tive in an attempt to portray real’ scenes in a realistic way. The modern 
system of focused perspective was discovered by Brunelleschi and finally 
perfected by Leonardo da Vinci; it is well illustrated by the woodcuts of 
Albrecht Diirer. 

The family of lines joining an eye to each point of a scene meets a screen 
in front of the eye, and the image on the screen is called a cross-section (or 
section). The cross-section gives a realistic two-dimensional representation 
of the three-dimensional scene. 

For two planes z and z’ that do not pass through the origin O in R°, points 
P in x and Q in z’ are in perspective from O if there is a straight line 
through O, P and Q. 

A perspectivity from z to z’ centred at O is a function that maps a point 
P of x toa point Q of z’ whenever P and Q are in perspective from O. 
(The planes z and zr’ may lie on the same or on opposite sides of O.) 


. The domain of a perspectivity may not be the whole of z; for, if P is any 


point of z such that OP is parallel to to x’, then P cannot have an image 
inz’. 

The image of a line @ under a perspectivity is another line, possibly minus 
one point. 

Foreshortening is the effect under a perspectivity of lines of equal 

lengths at different distances from a screen corresponding to different 
lengths on the screen. 
Two parallel lines in a horizontal plane z appear to an observer to meet at 
a vanishing point on a vertical screen z’; this is the principal vanishing 
point if the lines are perpendicular to the line of intersection of z and z’, 
and a diagonal vanishing point otherwise. 

The family of vanishing points is a line, the vanishing line, that 
corresponds to the horizon line in a picture. 


. Desargues’ Theorem Let AABC and AA’ B’C’ be triangles in R? such that 


the lines AA’, BB’ and CC’ meet at a point U. Let BC and B’C’meet at P, 
CA and C’A’meet at Q, and AB and A’ B’meet at R. Then P, Q and R are 
collinear. 
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Section 3.2: The Projective Plane RP? 


1. A Point (or projective point) is a line in R? that passes through the origin 
of R?. 

The real projective plane RP° is the set of all such Points. 

2. The expression [a, b,c], in which the numbers a, b and ¢ are not all zero, 
represents the Point P of RP? which consists of the unique line in R? that 
passes through (0,0,0) and (a,b,c). We refer to [a,b,c] as homogeneous 
coordinates of P. 

If (a,b,c) has position vector v, then we often denote P by [v] and we 
say that P can be represented by v. 

It makes NO sense to write the expression [0,0,0], since not all of a, b 
and c can be zero. 

3. The homogeneous coordinates [a,b,c] and [Aa,Ab,Ac] (where A 4 0) 
represent the same Point of RE*; that is, [a,b,c] = [Aa,Ab, Ac], for any 
A #0. 

If there is no non-zero real number A such that [a, b,c] = [Aa’ Ab, Ac | 
then the homogeneous coordinates [a, b, c] and [a’ ,b',c’ | represent differ- 
ent Points of RP. 

Further, [a’, b’, 1] = [a’,b”, 1] if and only if a’ = a" and b’ = b”. 

4. A projective figure is a subset of RP’. 

5. A Line (or projective line) in RP? is a plane in R? that passes through the 
origin. Points of RP? are collinear if they lie on a Line. 

6. The general equation of a Line in R P? is ax + by + cz = 0, where a, b,c 
are real and not all zero. 

7. Collinearity Property of RE? Any two distinct Points of RP? lie on a 
unique Line. 

Strategy To determine an equation for the Line in RP? through the Points 
[d,e, f] and [g,h,k]: 


x yo z 
1. write down the equation |d e ff) =0; 
g h_ k 


2. expand the determinant in terms of the entries in its first row to obtain 
the required equation in the form ax + by + cz = 0. 
Sometimes it is possible to ’spot’ the equation of a Line through two Points 
without using the determinant. 
8. Strategy To determine whether three Points [a, b,c], [d,e, f] and [g,h, k] 
are collinear: 


abe 
1. evaluate the determinant |d e f|; 
g hi k 


2. the Points [a,b,c], [d,e, f] and [g,,k] are collinear if and only if 
this determinant is zero. 
9. The Points [1,0,0], [0,1,0], [0,0,1] are known as the triangle of reference. 
The Point [1,1,1] is called the unit Point. 


Summary of Chapter 3 


10. 


11. 


13. 


Incidence Property of RE? Any two distinct Lines in RP? intersect in a 
unique Point of RP?. 
Let a be any plane in R® that does not pass through the origin O. Then 
there is a one-one correspondence between the points of 2 and those 
Points of RP? which pierce 2. Those Points of RP? which do not pierce 
z are called ideal Points for zz. 

The set of ideal Points for z is a plane through O parallel to z, called 
the ideal Line for z. 


. An embedding plane is a plane, 2, which does not pass through the 


origin, together with the set of all ideal Points for zr. 

The plane in R? with equation z = | is called the standard embedding 
plane. The mapping of RP? into the standard embedding plane is called 
the standard embedding of RP”. 

Parallelism is not a projective property. 


Section 3.3: Projective Transformations 


1. 


A projective transformation of RP? is a function t: RP* —> RP? of the 
form t : [x] + [Ax], where A is an invertible 3 x 3 matrix. We say that A 
is a matrix associated with t. The set of all projective transformations is 
denoted by P(2). 

If A is a matrix associated with ft, then so is AA for any non-zero 
number i. 


. The set of projective transformations P(2) forms a group under the oper- 


ation of composition of functions. In particular, if t; and f are projective 
transformations with associated matrices A; and Ao, respectively, then 
tof) and ft, are projective transformations with associated matrices 
A;A2 and Aj!. 

Strategy To compose two projective transformations f) and f2: 

1. write down matrices A; and A» associated with ft; and fo; 

2. calculate A; Ao; 

3. write down the composite f; 0 f2 with which A; Az is associated. 
Strategy To find the inverse of a projective transformation f: 

1. write down a matrix A associated with f; 

2. calculate A7!: 

3. write down the inverse t~! with which A~! is associated. 


. Strategy To find the image of a Line ax + by + cz = 0 under a projective 


transformation ¢ : [x] > [Ax]: 

1. write the equation of the Line in the form Lx = 0, where L is the 
matrix (abc); 

2. find a matrix B associated with r—!; 

3. write down the equation of the image as (LB) x = 0. 


. Projective geometry is the study of those properties of figures in RF? that 


are preserved by projective transformations. Collinearity and incidence are 
both projective properties. 
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5. A quadrilateral is a set of four Points A, B,C and D (no three of which 
are collinear), together with the Lines AB, BC, CD and DA. 
All quadrilaterals are projective-congruent. 

6. Strategy To find the projective transformation which maps [1,0,0] 
to [a,,a2,a3], [0,1,0] to [b1,b2,b3], [0,0,1] to [c1,c2,c3], [1,1,1] to 
[d,, do, d3], where no three of ({a1, 42,43], ([b1, bo, b3], [c1, C2, €3] and 
[d,, dz, d3] are collinear: 


aquu bv cyw 1 d\ 
1. find u, v, w for which | a2u bov cow 1} =| ad]; 
anu b3v) c3w 1 d3 


2. write down the required projective transformation in the form f: 
[x] +» [Ax], where A is any non-zero real multiple of the matrix 
aju bv cyw 
anu bov cow 
a3u. b3v c3w 

7. Fundamental Theorem of Projective Geometry Let ABCD and 
A’B'C'D’ be two quadrilaterals in RP*. Then: 

(a) there is a projective transformation t which maps A to A’, B to B’, C 
to C’, Dto D’; 
(a) the projective transformation ¢ is unique. 

8. Strategy To determine the projective transformation t which maps the 
vertices of the quadrilateral ABCD to the corresponding vertices of the 
quadrilateral A’ B/C’ D’: 

1. find the projective transformation t; which maps the triangle of refer- 
ence and unit Point to the Points A, B, C, D, respectively; 

2. find the projective transformation t2 which maps the triangle of refer- 
ence and unit Point to the Points A’, B’, C’, D’, respectively; 

3. calculate tf = fy 0 i. 

9. With any given perspectivity o we can construct an associated perspective 
transformation that is a one-one mapping of z U {the ideal Points for 7} 
onto z’ U {the ideal Points for x’}. This maps RP? onto itself. 

Every projective transformation can be expressed as the composite of three 
perspective transformations. 


Section 3.4: Using the Fundamental Theorem of Projective 
Geometry 


1. Desargues’ Theorem Let AABC and AA’ B’C’ be triangles in R? such that 
the lines AA’, BB’ and CC’ meet at a point U. Let BC and B’C’meet at P, 
CA and C’A’ meet at Q, and AB and A’B’ meet at R. Then P, Q and R are 
collinear. 

2. The Fundamental Theorem is often used to simplify proofs of results in 
projective geometry, where the properties involved are projective proper- 
ties. Generally, we do not explicitly refer to the corresponding auxiliary 
projective transformation ¢ concerned, but simply comment that “By the 


Summary of Chapter 3 


Fundamental Theorem of Projective Geometry, we may choose the four 
Points ... (no three of which are collinear) to be the triangle of reference 
and the unit Point; that is, to have homogeneous coordinates [1,0,0], [0,1,0], 
[0,0,1] and [1,1,1], respectively.” 


. Pappus’ Theorem Let A, B and C be three points on a line in R?, and 


let A’, B’ and C’ be three points on another line. Let BC’ and B’C meet at 
P,CA’ and C’A meet at Q, and AB’ and A’B meet at R. Then P, Q and R 
are collinear. 


. Any Euclidean figure in an embedding plane corresponds to a projective 


figure in RP?. It follows that a Euclidean theorem concerned with projective 
properties (such as collinearity and coincidence) holds if and only if the 
corresponding projective theorem holds. 


. The dual of a statement about the projective properties of some figure in 


RP? is the corresponding statement about RP? in which the terms ’Point’ 
and ’Line’ are interchanged, and such other changes are made that ensure 
that the sentence makes sense. 

A triangle (three non-collinear Points and the Lines joining them) is self- 
dual. 


. A hexagon in RP? consists of six Points joined by the six Lines joining 


them in turn. 

Pappus’ Theorem (rephrased) Let the vertices A, B’,C, A’, B and C’ of 
a hexagon lie alternately on two different Lines. Then the Points of inter- 
section of opposite sides B’C and BC’, CA’ and C’A, and AB’ and A’B, are 
collinear. 


. Brianchon’s Theorem (the dual of Pappus’ Theorem) Let the sides 


AB’, B'C, CA’, A'B, BC’, C’A of a hexagon pass alternately through two 
(different) Points P and Q in RP*. Then the Lines joining opposite vertices 
A and A’, B and B’,C and C’, are concurrent. 


. Converse of Desargues’ Theorem (also its dual) Let two triangles be 


such that the Points through which corresponding sides pass are collinear. 
Then the Lines through the corresponding vertices of the two triangles are 
concurrent. 


Section 3.5: Cross-Ratio 


il, 


Let A, B,C, D be four collinear Points in RP? represented by the position 
vectors a, b, c, d, and letc = aa+ Bb and d = ya-+ db. Then the 
cross-ratio of A, B,C, D is (ABCD) = Bb / 2. 

The cross-ratio (ABCD) is independent of the homogeneous coordinates 
that are used to represent the collinear Points A, B,C, D. 


. Let A, B,C, D be four distinct collinear Points in RP2, and let (ABCD) = 


k. Then (BACD) = (ABDC) = 1/k and (ACBD) = (DBCA) = 1 — k. 


. Let t be a projective transformation, and let A, B, C, D be any four collinear 


Points in RP*. If A’ = t (A), B’ = t(B), C’ = t(C), D’ = t (D), then 
(ABCD) = (A’B‘C'D’). 
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4. Let A, B,C, D be four distinct Points on a Line, and let A’, B’,C’, D’ be 
four distinct Points on another Line such that AA’, BB’, CC’, DD’ all meet at 
a Point U. Then (ABCD) = (A’B’C'D’). 

5. Unique Fourth Point Theorem Let A, B, C, X, Y be collinear Points in 
REP? such that (ABCX) = (ABCY). Then X = Y. 

6. Let A, B, C, D and A, E, F, G be two sets of collinear Points (on different 
Lines in RP?) such that the cross-ratios (ABCD) and (AEFG) are equal. 
Then the Lines BE, CF and DG are concurrent. 

7. Let four collinear Points of RP? pierce an embedding plane at the points 
A, B,C, D with position vectors a, b, c, d, respectively. Then, if we can 
write c and d in the form c = Aa+ (1 — A) b andd = a+ (1 — )b, we 


have 
1-a fi—u AC /AD 
(ABCD) = b= 
i u CB/ DB 


8. Strategy To use an embedding plane to calculate the cross-ratio of four 
collinear Points: 
1. if the four Points pierce the embedding plane at A,B,C,D, then 


(ABCD) = 8g | 43: 


CB / DB’ 

2. if one of the Points is an ideal Point for the embedding plane, then 
A a if A is ideal 
(ABCD) = = if A is ideal, 
A CA aes 
(ABCD) = DA if B is ideal, 
A tad if C is ideal 
ECD if C is ideal, 

AC... siege 
(ABCD) = — _ if Dis ideal. 
BC 


9. Cross-ratios can be used to measure distances on the ground from aerial 
photographs, since the cross-ratio of any four points on a line on the ground 
equals the cross-ratio of their images on the film. 


4 Projective Geometry: Conics 


In the previous chapter, we used projective geometry to prove Pappus’ 
Theorem. 


Pappus’ Theorem Let A, B and C be three points on a line in R*, and 
let A’, B’ and C’ be three points on another line. Let BC’ and B’C meet at 
P,CA’ and C’A meet at Q, and AB’ and A’B meet at R. Then P, Q and R 
are collinear. 


In fact, we could also show that if the six points A, B,C, A’, B’ and C’ lie not 
on two lines but on an ellipse, a parabola or a hyperbola, then the three points 
of intersection P, Q and R are still collinear. 


The similarity of these three results is remarkable! It suggests that, instead 
of being three new theorems, perhaps these results are particular instances of a 
general theorem about conics. We shall show that this is indeed the case. The 
general theorem, known as Pascal’s Theorem, is proved in this chapter along 
with many other such results about conics. 

But why should a result like Pascal’s Theorem apply to different types of 
conic? A clue is provided by our discussion of conics in Chapter 1. There we 
explained that conics are so called because they are the shapes that we obtain 
when we take plane sections through a double cone. In particular, the non- 
degenerate conics are the ellipses, parabolas and hyperbolas, and they arise 
when we slice through a double cone with a plane that does not pass through 
the cone’s vertex v (see the figure below). 


Subsection 3.4.1, 
Theorem 2 


Subsections 1.1.1 
and 1.1.5 
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This has an exciting implication for projective geometry. Given any two 
non-degenerate conics, such as the circle and the parabola shown in the mar- 
gin below, there is a perspectivity centred at v that maps one onto the other. 
It follows that any two non-degenerate conics are projective-congruent. Con- 
sequently, any result involving the projective properties of collinearity and 
concurrence that holds for ellipses, for example, necessarily holds also for 
parabolas and hyperbolas. 


We saw earlier that in affine geometry all ellipses are affine-congruent, all 
parabolas are affine-congruent, and all hyperbolas are affine-congruent. One of 
the exciting features of projective geometry is that all non-degenerate conics 
are projective-congruent. Thus there is no distinction between ellipses, parabo- 
las and hyperbolas in this geometry, so we simply call them projective conics. 

In Section 4.1 we define the surfaces in R? that are called projective 
conics, we see that between any two non-degenerate plane conics there 
is a perspectivity, and we note that (analogously) all projective conics are 
projective-congruent. 

In Section 4.2 we observe that tangency is a projective property. We intro- 
duce a compact notation due to Joachimsthal for projective conics, and use it 
to find formulas for tangents, tangent pairs and polars for projective conics. 

In Section 4.3 we introduce two standard forms for the equation of a 
projective conic, and use these to prove theorems such as Pascal’s Theorem. 

In Section 4.4 we prove that all non-degenerate projective conics are 
projective-congruent, using Linear Algebra methods. 

Finally, Section 4.5 discusses duality in the context of projective conics. 


4.1 _—_ Projective Conics 


4.1.1. What is a Projective Conic? 


In the previous chapter we described how, given any (Euclidean) figure F in Subsection 3.2.1 
an embedding plane z, we can obtain the corresponding projective figure by 
drawing in all the Points of RP” that pass through points of F. 
For example, if F is the circle {(x, y,Z): x7 4 y? — a 1} which lies in 
the embedding plane z with equation z = 1, then the corresponding projective 
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figure is a right circular cone. If a Point [x’, y’, z’] of RE” lies on this cone, then 
it pierces the embedding plane at the point (x’/z’, y’/z’, 1). Since this point lies 
on F, it follows that 


Multiplying by (z’)*, and dropping the dashes, we obtain the following 
equation for the cone: 


r+ yr=2’, (1) 


Conversely, suppose that we start with a projective figure. Then we can 
represent the projective figure in an embedding plane z. Of course, the rep- 
resentation that we obtain depends on the embedding plane that we use. For 
example, if the projective figure is the hollow cone x* + y? = z?, then the 
representation can be bounded, unbounded, or even in two ‘bits’ depending on 
the position of z. 


ideal 
0 Points 
for 
qT 
bounded unbounded two bits 


In Subsection 4.1.4 we show that these representations of the cone are, in 
fact, ellipses, parabolas and hyperbolas, respectively. Notice, however, that in 
the case of the parabola and the hyperbola, the representation is incomplete 
unless we include additional ideal Points for the embedding plane concerned. 
The parabola requires one ideal Point, and the hyperbola requires two. These 
are indicated by the thick lines in the diagram above. 

Let us now concentrate on just one embedding plane zr, and consider the 
kinds of projective figures that correspond to conics in zr. To keep the algebra 
simple, we shall choose z to be the so-called standard embedding plane 
z = |. We have already explained how to find the equation of the projective 
figure which corresponds to the circle {(x, y, z): xe 4 y? =1,z=1}inz,so 
let us now consider what happens when we find the equation of the projective 
figure which corresponds to an unbounded conic, such as the hyperbola 


{(x, y,z) 19? — 4x7 = 1z= 1}. 
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‘missing’ 
Points 


Any Point [x’, y’, z’] on the corresponding projective figure must pierce z at 
a point (x'/z’, y’/z’, 1) on the hyperbola, and so 


Since z’ is non-zero, we can multiply by (z’)* and drop the dashes, to obtain _ If z’ were zero, then 


the equivalent equation [x’, y’,z’] would not 
pierce z. 


y—4xr7=7, 2 £0. 


As in the case of the circle, this is the equation of a cone-like family of 
Points in RP? with vertex at the origin, as shown above. Notice, however, 
that because z 4 0, two Points are missing from the family. These must be 
Points of the form [x, y, 0] which satisfy the equation y* — 4x” = 0. Since this 
equation implies that y = 2x, it follows that the missing Points are [1, 2,0] 
and [1, —2, 0]. 

But should these Points really be omitted from the projective figure? After 
all, both Points are ideal Points for the standard embedding plane, and figures 
in an embedding plane often have ideal Points associated with them. In projec- 
tive geometry we take the view that the ideal Points [1, 2,0] and [1, —2, 0] are 
associated with the hyperbola y? — 4x? = 1, and hence that the corresponding 
projective figure consists of the entire cone-like family of Points [x, y, z] that 


satisfy the equation y? — 4x? = z?. 


Problem 1 Find an equation for the projective figure in RP? which 
corresponds to the parabola {(x, y,z):y=x?,z=1} in the standard 
embedding plane. Which ideal Points should be associated with the 
parabola? 


In general, any conic in the standard embedding plane can be expressed in 


the form 
{(x, y,Z): Ax? + Bry + Cy? + Fx+Gy+H=0,z= 1}. 


Since any Point [x’, y’, z’] on the corresponding projective figure must pierce 
the standard embedding plane at a point (x'/z’, y’/z’, 1) on the conic, it follows 
that 


x! 2 x! y’ y! 2 x! y’ 
a(5) +2() (=)+¢(2) +F(S)+e(4)+H=0 
Zz z va Zz va z 
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Multiplying by (z’)*, and dropping the dashes, we obtain the equivalent 
equation 


Ax? + Bxy + Cy? + Fxz+Gyz+Hz2=0, z40. 


If we drop the constraint that z # 0, then we can include those ideal 
Points for the standard embedding plane that should be associated with the 
plane conic. The corresponding projective figure (including the additional ideal 
Points) is known as a projective conic. 


Definition A projective conic in RP” is a set of Points whose homoge- 
neous coordinates satisfy a second-degree equation of the form 


Ax? + Bxy + Cy* + Fxz+ Gyz+ Hz’ =0. (2) 


For example, xy +xz-+ yz = 0 defines a projective conic, because it has the 
form of equation (2) with A = C = H = O and B = F = G = 1. However, 
x? + y* — 3y +27 = 0 does not define a projective conic, because it includes 
a linear term in y. 


Problem 2 Which of the following equations define projective 
conics? 


(a) x7 +.xy —3y? +4x —3y +727 =0 
(b) x7 ++xy+ y? + yz=0 
Oyem 

d) x?+y+22=2 


We say that a Point P lies on a projective conic, or a projective conic passes 
through a Point P, if the homogeneous coordinates of P satisfy the equation 
of the projective conic. For example, the Point [3,4,5] lies on the projective 
conic x? + y? — 2, since 32 + 42 = 52; however, x2 + a = z? does not pass 
through the Point [1, 1, 1], since 17 + 17 4 17. 


Problem 3 Which of the following statements are true? 


(a) The projective conic xy + xz + yz = O passes through the Point 


[1, 0, O]. 
(b) The Point [1, 2, 0] lies on the projective conic 2x” — y* +. xy +.xz+ 
2 
z° = 0. 
(c) The projective conic 3x” + 2y* — z* = 0 passes through the Point 
[1, 2, 3]. 


In this chapter we concentrate on those projective conics that can be 
represented by non-degenerate conics in the standard embedding plane. 


Definition A projective conic is non-degenerate if it can be represented 
by a non-degenerate conic in the standard embedding plane. 
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The additional Points have 
homogeneous coordinates 
of the form [x, y, 0] where 


Ax* + Bxy + Cy = 0. 


projective 
Zz conic 


It follows that a 
degenerate projective 
conic consists of a pair of 
Lines, a single Line, a 
Point, or the empty set 
(that is, ‘no Points’). 
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For example, the projective conic with equation 
x? 42y?— 27 =0 
is non-degenerate because it is represented in the standard embedding plane by 
the ellipse 
{(x,y,z) 1x7 +2y* =1,z= 1}. 
On the other hand, the projective conic with equation 
27x? + 30xy — 8y? + 14yz — 327 =0 
is degenerate because 
27x? + 30xy — 8y* + 14yz — 3z* = (9x — 2y + 3z)(3x + 4y — 2), 
so the projective conic intersects the standard embedding plane in the degen- 
erate conic which consists of the following pair of lines: 
{(x, y,z): 9x —2y+3=0,z= 1} 
and 
{(x, y,z):3x+4y—-—1=0,z = ]}. 
The following theorem shows that non-degenerate projective conics are 
preserved by projective transformations. 


Theorem 1 Let ¢ be a projective transformation, and let E be a non- 
degenerate projective conic. Then f(£) is a non-degenerate projective 
conic. 


Proof Let E have equation 
Ax? + Bxy + Cy* + Fxz+ Gyz+ Hz? =0. (2) 
Then any Point that lies on E has homogeneous coordinates [x, y, z] which 
satisfy equation (2). If [x’, y’, z’] is the image of [x, y, z] under ft, then under 
the inverse transformation we have [x, y, z] = t"([x', y’, z’]). It follows that if 


abe 
de ff 
g h k 


is a matrix associated with t~!, then 
x=ax’+by+cz, y=dx'+ey'+ fz, z=gx' thy +k. 

Substituting these expressions for x, y and z into equation (2), we obtain a 
second-degree equation in x’, y’ and z’. It follows that the image of E under t 
is a projective conic. 

Next we show that this image cannot be degenerate. A degenerate image 
would consist of a pair of Lines, a single Line, a Point, or the empty set (that 
is ‘no Points’). Since the projective transformation t~! maps Lines to Lines 
and Points to Points, it would map the degenerate image to another degenerate 
' maps the image back to 
the original non-degenerate projective conic E. It follows that t(£) cannot be 


projective conic. But this cannot happen since t~ 


degenerate. B 
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4.1.2 Tangents to Projective Conics 


Let E be any non-degenerate projective conic, and let @ be a Line in RP?. 
Then £ is a plane in R? which passes through O, and E is a surface in R? 
which consists of a ‘cone-like’ family of lines through the origin O. It follows 
that there are three possibilities: 

£ can meet E at a pair of Points; 


é can meet EF at a single Point; 
£ can meet E at no Points. 


two Points of intersection one Point of intersection no Points of intersection 


In the second case, @ just ‘touches’ F along a ‘Point of contact’ P. This 
suggests that in such a case we define ¢ to be the tangent to E at P. 


Definition Let E be a non-degenerate projective conic. Then a Line £ is a 
tangent to E at P if meets E at a Point P, and at no other Point. 


We can also define whether a Point lies inside or outside a projective conic. 


Definitions Let E be a non-degenerate projective conic. A Point Q lies 
inside E if every Line through Q meets E at two distinct Points. A Point R 
lies outside E if there is a Line through R that meets E at no Points. 


a Point Q 
inside E 


a Point R 
outside E 


Lines through Q 
meet E 
at two Points 


a Line through R 
that meets E 
at no Points. 
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The following theorem shows that ‘tangency’ and ‘lying inside or outside a 
projective conic’ are projective properties. 


Theorem 2 Let ¢ be a projective transformation, and let the Line € be a 
tangent to a non-degenerate projective conic E at a Point P. Then f(€) is a 
tangent to t(E£) at t(P). Also, if Q is a Point inside E, then t(Q) lies inside 
t(E); and if R is a Point outside E, then t(R) lies outside t(£). 


Proof By the definition of tangent, P is the only Point that @ and E have in 
common. Since ¢ is a one-one map of RP? onto itself, it follows that t(P) is 
the only Point that ¢(£) and t(£) have in common. In other words, t(€) is a 
tangent to t(E) at t(P). 

Also, if Q lies inside E, then any Line €’ through t(Q) must meet r(E) at 
two distinct Points, for otherwise the Line t—!(¢’) through Q would not meet 
E in two distinct Points. It follows that t(Q) lies inside t(£). 

Again, if R lies outside FE, then there is a Line @ through R that does not 
meet E. It follows that ¢(€) is a Line through f(R) that does not meet ¢(F), and 
so t(R) lies outside t(F). | 


tangent to 
plane conic 


tangent to 
projective conic 


The figure above illustrates that, in an embedding plane, tangents to projective 
conics correspond to tangents to plane conics, and vice versa, and so the two 
notions of tangency are consistent. 

Moreover, since a projective transformation preserves the property of being 
a tangent, it follows that we can use projective geometry to tackle problems 
involving tangents to plane conics. You will meet several examples of this later, 
but first we must show that all non-degenerate projective conics are projective- 
congruent. 
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4.1.3 Some Preliminaries 


In our proof that all non-degenerate projective conics are projective-congruent Theorem 5, Subsection 
we need a number of facts concerning plane conics. In order to concentrate on 4.1.4 
the key ideas in the proof then, we deal with these facts now. 
First, in our proof we shall need to calculate the eccentricity of conics which 
are symmetrical about the v-axis in the (u, v)-plane, and which have a focus 
that lies on the v-axis. We do this by using the following result. 


Theorem 3 Eccentricity Formula 


Let E be a non-degenerate plane conic with equation We use this result in 
> > Subsection 4.1.4. 
i 4p (Cor 46 Gu ap Jal = 0, 


If E has a focus on the v-axis, then the eccentricity e of E is given by the We use (u, v) as our 
formula coordinate system here 
rather than (x, y) to match 
e’=1-C. our notation in later 
discussion. 


For example, the eccentricity e of the ellipse with equation 
u? + 4u°—-Tu+4=0 
is given by e* = 1 — 5 = 7 so that e = 1/,/2. 


Proof First suppose that E is a circle. Then C = 1, and by convention e = 0, 
soe? = 1—C,as required. 

Next suppose that E is not a circle. Since the equation of E has no terms that 
involve u or uv, it follows that E is symmetrical about the v-axis. Also, since FOr)® 
E has a focus F on the v-axis, it follows, by symmetry, that the directrix which 
corresponds to F is perpendicular to the v-axis. Hence F has coordinates (0, 7) 
for some real number r, and the directrix has equation v = s, for some real 


number s. 
Let P(u,v) be an arbitrary point on the conic, and let PQ be the t=; 
perpendicular from P to the v-axis. Then, by Pythagoras’ Theorem, we have 


PF* = FQ? + OP? =(v—-r)+u’. 


ay 


Now let PD be the perpendicular from P to the directrix. Then, by the 
focus-directrix property, we have PF = e- PD =e- |v —s|, so 


e-(u _ s)° = PF’ = (v- ry 4’, 
Expanding the brackets and collecting terms, we obtain 
“+ di —- e”)v* —2r— e’s)u + (r? - e’s°) = 0) 


Comparing this with the equation of F in the statement of the theorem, we 
see that C = (1 — e”), and hence that e? = 1—C. Oo 


Problem 4 Determine the eccentricity of the hyperbola with equation 


Qu? — 6v? +5v—1=0. 
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In our work in the following subsection we shall also need an understanding 
of the relationship between conics which have the same eccentricity. So, first, 
observe that all non-degenerate conics with a given distance d between the 
focus and the directrix, and a given eccentricity e, are Euclidean-congruent to 
each other. This means that the size and shape of a non-degenerate conic are 
completely determined by the numbers e and d. 

If we now fix e and vary the size of d, then the shape of the conic remains 
the same but the size of the conic changes. The following figure shows the 
effect that an increase in d has on each type of conic. 


> OO )¢ 


double double double 

d d d 

By allowing d to vary throughout the interval (0, 00), we obtain every size of 
conic with a given eccentricity e. We can therefore obtain any non-degenerate 


conic by first choosing a conic with the correct eccentricity and then adjusting 
its size by varying d. 


4.1.4 Conics in Perspective 


We now demonstrate that, for any two given plane conics, there is a perspec- 
tivity between them. Indeed, we can draw a right circular cone such that each 
of the given plane conics arises as the intersection of the cone with a suitable 
plane. It follows that there is a projective transformation that maps any given 
projective conic onto any other, so the property of ‘being a conic’ is a pro- 
jective property. This enables us to prove quite surprising results about plane 
conics. 


Plane Sections of a Right Circular Cone 


Earlier, we asserted that we can construct all conics by taking different plane 
sections through double cones. We now justify that claim, and describe its 
relevance to projective geometry. 

First, we generate a hollow right circular cone by taking a line through the 
origin in R? at some angle ¢ to the (x, y)-plane, and rotating that line around 
the z-axis. We call the line a generator of the cone. Next, we cut the cone with 
planes (none of which passes through the origin) at various angles, and make 
the following observations illustrated below. (a) When the plane is horizontal, 


Euclidean-congruence 
was explained in 
Subsection 2.1.2. 


Recall that (0, co) denotes 
the interval {x : x > O}. 


This fact about the 
intersection was known to 
the ancient Greek 
mathematicians around 
300 BC. 


Subsection 1.1.1 


Here 0 < @ < 1/2 
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the section is a circle. (b) As the plane tilts, its curve of intersection with the 
cone looks like an ellipse. (c) When the plane becomes parallel to one of the 
generators of the cone, the curve of intersection looks like a parabola. (d) As 
the plane tilts further, it meets both portions of the cone, and the curve of 
intersection looks like a hyperbola. (e¢) When the plane is vertical we obtain the 
fattest possible hyperbola that can be obtained from this cone. The asymptotes 
of this hyperbola are parallel to a pair of lines on the cone that pass through 
the origin. 


: A i { 
(a) (b) (c) (d) (e) 


To prove that these observations are correct, we do some algebra. 

Since the generator of the cone through the first quadrant of the (y, z) -plane 
has slope tan @, this generator has equation {(x, y, vy): z = tang: y, x = O}, 
or {(x, y,Z):y=Z rh tang, x = O}. Rotating this generator around the z-axis 
gives the whole of the cone, which must therefore have equation 


22 


2 2 
x =: 3 
ry ante (3) 
Next, let a be the plane in which we are interested that cuts the cone, 
and let 6 be the angle between the (x, y)-plane and z. We shall assume for 
convenience that the x- and y-axes have been chosen so that z intersects the 
(x, y)-plane in a line parallel to the x-axis. Then if we rotate the (x, y)-plane 


about the x-axis through an angle @, it becomes parallel to z. 


rotate 
(x, y)-plane 


This rotation moves the x-, y- and z-axes into new positions, which we call 
the x’-, y’- and z’-axes. If a given point in R? has coordinates (x, y,z) and 
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generator 


ae 


This equation describes 
both the upper part and 
the lower part of the cone. 


Here 0 < 0 < 71/2. 
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(x’, y’,z’) with respect to these two sets of axes, then the connection between za 
these coordinates is given by the matrix equation 


(0, — sind, cos6) 


eV (1 0 0 V/s ce, 

is 1 (0, cos@, sin@) 
y |={|{ 0 cos@é —siné y |. (4) aot, (ele * 
z 0 sinO  cosé z! (1,0, 0) 


Notice that here the columns of the 3 x 3 matrix of the rotation comprise the 
coordinates (with respect to the initial x-, y-, z-axes) of the points | unit along 
each of the final x’-, y’-, z’-axes (as shown in the margin). 

Next, we translate the (x’, y’)-plane through some distance d parallel to the — Here d does not denote a 
z/-axis until it coincides with the plane z. This translation sends the x’-, y’-  focus-directrix distance, 
and z’-axes into new positions, which we call the u-, v- and w-axes. but an arbitrary positive 

If a given point in R* has coordinates (x’, y’,z’) and (u, v, w) with respect 
to these two sets of axes, then the coordinates are related by 


number. 


x’ u 0 
y |=] uv tt oO ft. (5) 
z! w d 


Overall, it follows from equations (4) and (5) that if a given point has coor- 
dinates (x, y, z) with respect to the x-, y- and z-axes, and coordinates (u, v, w) 
with respect to the u-, v- and w-axes, then 


x 1 0 0) Uu 0) 
y }={ 0 cos@ —sin@ v J+] 0 
Zz O sin@O~ cosd@ w d 
1 0 0 u 0 
= 0 cosOé —siné v + —dsin@ 
0 sin@  cosd@ w dcos@ 


Now, for points on the curve where z intersects the cone, we have w = 0. 
So at these points, 


x 1 0 0) u 0 
y }=|{ 0 cos@ —siné v |+{ —dsiné |, 
Zz 0 sin@  cosé 0 dcos@ 
that is, 
xX=U, 
y = vcosé —dsin6, (6) 


z=uvusind+dcosé. 


Since the points of intersection lie on the cone, their coordinates (x, y, z) must 
satisfy the equation of the cone given in equation (3). Hence, if we substitute 
for x, y and z from equations (6) into equation (3), we obtain 


(uv sin@ + dcos@)* 


2 + ny2 
6 —dsin@)* = 
u“ + (ucos sin 0) ante 
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After rearranging the terms in this equation, we obtain an equation of the form 


u?+Cv°+Gu+H =0, (7) 
where C, G and H are expressions involving 6 and @. In particular, 
2 sin? @ 2 tan? 6 
C =cos* 6 — = cos’ 9 {| | — —~— }. (8) 
tan? tan2 


Since equation (7) is a second-degree equation in u and v, the curve of intersec- 
tion is certainly a plane conic. Moreover, the conic is clearly non-degenerate 
with a focus on the v-axis. But which type of conic is it? 

First, suppose that 6 < @, so that z is less steep than the generators of the 
cone. Then, it follows from equation (8) that C > 0. Hence, if we apply the 
‘B? — 4AC test’ to equation (7), we find that B* —4AC = —4-1-C <0, and 
so the curve of intersection is an ellipse. 

It follows from equations (7) and (8) and the Eccentricity Formula that the 
eccentricity e of the ellipse is given by 

tan? 6 
PSC lc 91 =— 
tan? d 
sin? @ 
tan2 
sin? 0 
tan? 
sin? 6(1 + cot” ¢) 
sin? @ 


sin’ p’ 


=1-—cos?6+ 


sin? 6 + 


so that 
sin 0 


sing 


As @ increases from 0 to ¢,e increases from 0 to 1. Thus, by tilting z 
through a suitable angle @ in the interval [0,), we can obtain an ellipse with 
any desired eccentricity between 0 and |. If we then move z parallel to itself, 
the angle 6 remains the same, and so the eccentricity of the ellipse remains the 
same, but the size of the ellipse can be adjusted by any desired dilatation factor. 
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This test was described in 
Section 1.3. 


Theorem 3, Subsection 
4.1.3 


As @ increases, the ellipse 
becomes longer and 
thinner. 
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It follows that we can obtain ellipses of all possible eccentricities and sizes 
by choosing the intersecting plane z to be at the appropriate angle and at the 
appropriate distance from the origin. 

Next, suppose that 6 = @, so that z is parallel to a generator of the cone. 
Then, it follows from equation (8) that C = 0. Hence, if we apply the ‘B? — 
4AC test’ to equation (7), we find that B*2—4AC = —4-1-C =0, and so the 
curve of intersection is a parabola. 

As the plane moves further from the origin, the size of the parabola 
increases. It follows that we can obtain parabolas of all possible sizes by 
choosing the intersecting plane to be at the appropriate distance from the 
origin. 

Finally, suppose that 6 > @, so that zr is steeper than the generators of the 
cone. Then, it follows from equation (8) that C < 0. Hence, if we apply the 
‘B? — 4AC test’ to equation (7), we find that B* -4AC = —4-1-C > 0, and 
so the curve of intersection is a hyperbola. 

But, as we saw earlier, the eccentricity e of the hyperbola is given by 


sin @ 


sing: 


As @ increases from ¢ to 1/2, e increases from | to cosec @. 

In particular, for each given value of 0, the eccentricities of the hyperbolas 
obtained from all planes of that slope are equal. And as the plane moves further 
from the origin, the size of the hyperbolas increases. 

Thus, by tilting the plane of intersection through a suitable angle 6 in the 
interval [#, 2/2], we can obtain a hyperbola with any eccentricity in the inter- 
val [1, cosec @], and by moving the plane parallel to itself, we can adjust the 
size of the hyperbola by any desired factor. But how can we obtain a hyperbola 
with eccentricity greater than cosec #? 

Well, notice that the asymptotes of each hyperbola are parallel to the two 
lines of intersection of the cone with a plane through the origin that is parallel 
to the intersecting plane. It follows that the angle between the asymptotes of 
the hyperbola can be any angle from 0 to the angle between the lines of inter- 
section of the cone with a vertical plane through the origin. In other words, 
they can be no further apart than two opposite generators of the cone. So 
not every hyperbola can be found in every cone! Hence, in order to obtain 
any given hyperbola as a curve of intersection of a plane with a cone, we 
need to choose a ‘fat enough’ cone (that is, a cone with a sufficiently small 
angle ¢). 

This completes the proof of the following fact. 


asymptotes 


Theorem 4 Every non-degenerate plane conic can be found as the curve 
of intersection of a suitable right circular cone with a suitable plane. 


We can now use this theorem to illustrate that there is a perspectivity 
between any two non-degenerate plane conics. 


Projective Conics 


First, we illustrate why there is a perspectivity between any two ellipses E 
and E>. Choose any right circular cone with vertex at the origin O. Then it 
follows from the details in the proof of Theorem 4 that there are two planes 1 
and zz whose curves of intersection with the cone are FE; and E. The required 
perspectivity is simply the point-to-point mapping of E; onto F2 along the 
generators of the cone. 


Similarly, there is a perspectivity between any ellipse E; and any parabola 
E. For, given any right circular cone with vertex at the origin O, it follows 
from the earlier explanations that there are two planes zr; and 72 whose curves 
of intersection with the cone are E, and E>. 

In the same way, there is a perspectivity between any ellipse E; and any 
hyperbola Ey. Again, the perspectivity maps each pair of curves point-to-point 
along generators of the cone—but this time we need to choose a sufficiently 
fat cone in the first place, so that some plane intersects the cone in the 
hyperbola E>. 

In general, there is a perspectivity which maps any given non-degenerate 
plane conic onto any other given non-degenerate plane conic. It can be realized 
as a point-to-point map along the generators of a cone that is fat enough to yield 
both conics as sections through the cone. 

Next, just as there is a perspectivity between any two non-degenerate 
plane conics there is a projective transformation that maps any non-degenerate 
projective conic onto any other non-degenerate projective conic. 


Theorem5 All non-degenerate projective conics are projective-congruent. 


Three Tangents Theorem 


The correspondence between plane conics in an embedding plane and projec- 
tive conics in RP? enables us to match Euclidean theorems about plane conics 
with projective theorems about projective conics. Provided that the theorems 
are concerned exclusively with projective properties, then a Euclidean theorem 
will hold if and only if the corresponding projective theorem holds. 

Later we shall meet a whole range of applications of the projective- 
congruence of all non-degenerate projective conics. However we end this 
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Here we are assuming that 
7r2 is an embedding plane, 
and hence that its 
intersection Ey with the 
cone includes the ideal 
Point associated with the 
parabola. 


Here E> includes two 
ideal Points. 


This map is one-one and 
onto provided that the 
conics include their 
associated ideal Points. 


We postpone the proof of 
Theorem 5 to Section 4.4, 
to concentrate on our 
main story-line. 


Section 4.3 
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section with the following striking result, to give you a ‘taster’ of things 
to come! 


Theorem6 Three Tangents Theorem 

Let a non-degenerate plane conic touch the sides BC,CA and AB of a 
triangle AABC in R? at the points P, Q and R, respectively. Then AP, BO 
and CR are concurrent. 


The following figures illustrate the Three Tangents Theorem for an ellipse 
and a parabola. 


Proof The theorem concerns a non-degenerate conic, its tangents, and con- 
currency of lines. Since all of these properties are projective properties, it is 
sufficient to prove the result for any non-degenerate plane conic. For simplicity, 
therefore, we take the plane conic to be a circle. 

Since, by symmetry, the two tangents from a point to a circle are of equal 
length, it follows that AQ = AR, BP = BR and CP = CQ, so that, in particular, 


AR BP CQ _ 

RB PC QA 
By applying the converse to Ceva’s Theorem to this equation, it follows that 
the lines AP, BQ and CR are concurrent. a 


4.2 Tangents 


Many results about plane conics and projective conics involve properties of 
their tangents and polars. In this section we introduce a notation due to 
Joachimsthal that can be used to write down the equations of such tangents 
and polars. 


4.2.1 Tangents to Plane Conics 


Let P be a point on a non-degenerate plane conic EF, and let ¢ be the tangent at 
P to E. Next, let P’ be a point close to P on E, and let ¢’ be the line through 


A 
R Q 
B P c 
Subsection 2.4.2 


Such results appear in 
Section 4.3. 


Ferdinand Joachimsthal 
(1818-1861) was a 
distinguished German 
geometer, noted for his 
mature, polished 
exposition. 


Tangents 


P and P’. If we let P’ approach P along the curve E, the direction of the 
line €’ approaches the direction of the tangent €. We may phrase this rather 
loosely as follows: ‘the direction of the tangent is the limiting direction of the 
chords’. 

We begin by recalling that a non-degenerate plane conic has an equation of 
the form 


Ax? + Bxy + Cy? + Fx+Gy+H =0. 


If we denote the expression on the left-hand side of this equation by the 
symbol s, then the equation of the conic can be written very simply as 


s=0. 


Joachimsthal’s approach is to investigate the equations of tangents and 
polars to the conic by systematically attaching subscripts to the symbol s. For 
example, to check whether a point P; = (x1, y1) lies on the conic it is neces- 
sary to replace the variables x and y in s by x; and yj, respectively. This yields 
a number which we denote by the symbol 511; in other words, 


S] = Ax? + Bury; + Cy; + Fxi + Gy + H. 


If sj; = 0, then we can conclude that P lies on the conic; and if s1; 4 0, 
that P; does not lie on the conic. 

Similarly, the point P2 = (x2, y2) lies on the conic if and only if the number 
S22 defined by 


522 = Axi + Bxoyo + Cys + Fx. + Gyr + H 


is equal to zero. 

Joachimsthal’s notation can also be used to obtain a kind of ‘average’ num- 
ber when we ‘mix’ the subscripts 1 and 2 to define a number s;2 associated 
with two given points P; = (x1, y;) and Pz = (x2, y2) to be 


s2= Ax, x2 + B (22s) + Cyiy2 + F (2) 


+6(222) su. 


The reason for defining 512 in this way will become apparent shortly. Notice, 
for the moment, that the definition of s;2 is symmetrical in the sense that the 
value of sj2 is unaltered if we interchange the subscripts 1 and 2. In other 
words 512 = 521, a fact which we will use later. 

So far we have attached double subscripts to the symbol s, and this has 
always produced a number. Notice, however, that if we drop the second sub- 
script in the definition of 512, then the resulting expression is a linear expression 
in x and y, defined by 


S| =Anx+B(2EM) 4 cnysr (St) 46() an 


2 


Once again the reason for defining s; in this way will become apparent 
shortly. A clue is provided by the fact that in plane geometry a line can be 


217 


Subsection 1.2.1 


s=0 


P,\=(%,y) 


We may define a number 
$33 associated with a point 
P3 = (x3, y3) ina similar 
way. 


We could define s;3 and 
$93 in a similar way. 


Here we temporarily use 
the term /inear expression 
in x and y rather loosely, 
to mean an expression of 
the form ax + by +c, for 
some real numbers a, b 
and c. 
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defined by setting a linear expression in x and y equal to zero. It will turn 
out that the line defined by s; = O plays a particularly important part in our 
discussion of tangents. 

We may give a general summary of Joachimsthal’s notation in which we use 
the symbols i and j to stand for arbitrary subscripts, each of which can take 
the values 1, 2 or 3, as follows. 


Joachimsthal’s Notation for Plane Conics 
Let a plane conic have equation s = 0, where 


Ag Saye eGy a Gyre 


and let P} = (x1, y1), Po = (x2, y2) and P3 = (x3, y3) be points of R?. 


Then we define In general it is simpler 
Pee ay: Pala see NOT to try to remember 

S; = Axjx + (sae +Cyy+F E + GC u + H, these formulas, but to 
2 2 2 remember the pattern for 
Si = ie + Bay) -F Gye + Fx; + Gy; +H, obtaining the expressions 

Sj, Sij and Sij from s. 
and 
XiVj +XjVi Xj + Xj yi t+ Yj 
sij = Axjxj + BL 5 2" + Cyiyj + F- 5 14+G= 5 J +H, 


where i and j can each take the values 1, 2, or 3. 


The following example illustrates how to work out these expressions in 
Joachimsthal’s notation for conics. 


Example 1 Determine sj, 522, 5;2 and s; for the hyperbola with equation 
3x? — Ixy — y*+5x-—y—-4=0 

at the points P; = (3,2) and P) = (—5, —2). Hence determine whether either 

P, or P2 lies on the hyperbola. 


Solution The equation of the conic may be written in Joachimsthal’s 
notation as s = 0, where 


s = 3x*—2xy— y*+5x—y—4. 
Since here we have x; = 3, yj = 2,x2 = —5 and y2 = —2, we deduce that 
sy, = 3-37 -2-3-2-274+5-3-2-4=20, 


822 = 3+ (5) —2- (-5) + (-2) — (-2)? + 5+ (—5) — (-2) 4 = 24, 


oases) 2 ee Decif +55 


acy 
2 = - 


Tangents 


and 
3y+x-2 34+x 2+y 
=3-3-x—-2— =9 5 a a 
mt 2 Pa 2 
_ 19 if 5 
= 9 2 2 


Since sj, and s22 are both non-zero, it follows that neither of the points P; 
or P2 lies on the hyperbola. 


Problem 1 Determine 511,522,512 and s; for the plane conic with 
equation 


2x? + 3xy —y?+x+4+2y+1=0 
at the points P; = (1,0) and P2 = (2,1). 


Having introduced Joachimsthal’s notation we now turn our attention to 
finding the equations of tangents to a conic s = O. Since a tangent is a line 
which intersects the conic at two coincident points, we first describe how to 
determine the points where a given line @ meets s = 0. 

Recall that every point P on the line @ through two given points P; = 
(x1, 1) and Py = (x2, y2) divides the segment P; P2 in the ratio k : 1, for 
some real number k, and so has coordinates that may be written in the form 


kxg +x, kyo+y1 
k+1 7 k+1 


It follows that the line through the points P; and P, meets the conic with 
equation 


(s =) Ax? + Bxy+ Cy? + Fx+Gy+H =0 
at points which divide the segment P; P2 in the ratio k : 1, where 
k : k k k : 
A x2 +X] 4B X2 +X] yo2+ yi 4C y2+ yi 
k+1 k+1 k+1 k+1 
kx2 + x] kya + y1 


F G 
> k+1 + k+1 


+H =0. 


If we multiply both sides of this equation by (k + 1)? and collect the coef- 
ficients of the terms involving k,k and the terms independent of k, it turns 
out that we can rewrite this equation in terms of Joachimsthal’s notation in the 
marvelously simple form 


syok* + 2si2k + 511 = 0. 


This equation occurs so frequently in our work that we give it a special 
name, Joachimsthal’s Section Equation. 
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Section Formula, 
Appendix 2 


We omit the unedifying 
details. 


220 4: Projective Geometry: Conics 


Since Joachimsthal’s Section Equation is a quadratic equation in k, the line 
through P; and Pz meet the conic at two distinct points, at one repeated point, 
or not at all depending on whether the quadratic equation has two distinct real 
roots, one repeated real root or no real roots, respectively. 


Py Py Py 


Two distinct real roots One repeated real root No real roots 


Example 2 Determine the ratios in which the hyperbola with equation 
3x? — Ixy — y?+5x-y—4=0 
divides the line segment from P; = (3,2) to Py = (—5, —2). 
Solution First observe that the hyperbola and the points P; and P2 are the 
same as those used in Example 1, so we can use the values 51; = 20, s22 = 24 


and s}2 = —34 calculated there. It follows that we can rewrite Joachimsthal’s 
Section Equation in this case as 


24k? — 68k + 20 = 0, 
or 
6k? — 17k +5 =0, 
so that 
(3k — 1)(2k — 5) =0. 


Thus k = ; ork = 3. Thus, the hyperbola divides the line segment P; P2 at 
two distinct points, in the ratios 5 : 1 and 3 : 1; that is, in the ratios 1:3 and 
5:2, respectively. 


Problem 2 Determine the ratios in which the hyperbola with equation 
2x? + 3xy —y?+x4+2y+1=0 
divides the line segment from P; = (1,0) to Po = (2, 1). 
Hint: _ Use your results from Problem 1. cca 
We are now in a position to find the equation of the tangent to the plane 


conic with equation s = 0 at a point P} = (x1, y;) on the conic. To do this, let 
£ be a chord of the conic which passes through P;, and let P2 = (x2, y2) be 
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a point on € which lies outside the conic. Since the chord through P; and P 
meets the conic at two distinct points, it follows that Joachimsthal’s Section 
Equation has two distinct (real) roots k. 


Now let P2 move so that the chord through P; and P2 becomes a tangent 
to the conic at P;. This tangent is a line that meets the conic at the point P; 
alone, and so the corresponding Joachimsthal’s Section Equation must have a 


repeated real root. Hence it follows that we must have Recall that the condition 


2 for a quadratic equation 
(2512) = 4511522, 


(2 be lies 
or, equivalently, ax” +bx+c=0 


(s12)* = 511520. to have a repeated root is 
just b? = 4ac. 
Since P lies on the conic, we know that 51; = 0; so we must have 


$12 = 0. 


It follows from this equation that the point P2 = (x2, y2) must satisfy the Remember that s1 is 
equation s; = 0. But P) is an arbitrary point on the tangent to the conic at P,, obtained from s}7 by 


and so the tangent at P; must have the equation s; = 0. dropping the subscript 2. 


Theorem1 Let P; = (x1, y1) be a point on a non-degenerate plane conic 
E with equation s = 0. Then the equation of the tangent to E at P; is 
oy = ©. 


Example 3 Determine the equation of the tangent at P; = (1,1) to the 
hyperbola with equation 


3x? — Ixy — y*+5x—-y—-4=0. 


Solution The equation of the conic may be written in Joachimsthal’s 
notation as s = 0, where 


s = 3x" —2xy—y*?4+5x-—y—4. 
Since here we have x; = | and y; = 1, we deduce that 


l-y+x-1 l+x I1+y 
i-y+5 
2 ae 2 


sy =3-1-x 2 


5 
=3x-3y-2. 
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The equation of the tangent at (1, 1) to the hyperbola is therefore 


9x —5y —4=0. 


Problem 3 Determine the equation of the tangent to each of the 
following non-degenerate plane conics at the given point: 


(a) x? — xy +2y —7 = Oat the point (—1, 2); 
(b) 3x? + 2xy — y? +x —2y —3 = Oat the point (1, 1). 


Tangent Pair from a Point to a Conic 


Now suppose that we wish to find the equation of a tangent to a plane conic Pi 
E from a point P; outside the conic. Intuitively we would expect there to be 

two such tangents, as shown in the diagram in the margin. We now check this 
algebraically. The technique is similar to the case where P| lies on the conic, 

as we discussed above. 


. . : ; E ou 
First we consider a chord of E that passes through the given point P; and 
another point P> that is also outside the conic; then we move P) so that the 
line becomes a tangent. As before, this occurs when Joachimsthal’s Section 
Equation has a repeated real root, that is, when 
(6,,)"=S,»- Sy. Here s,; 4 0, since P; 


does not lie on E. 


Then, since the point P is an arbitrary point on the tangent, we can drop the 
subscript 2 to obtain the equation 


2 
(si) = s+ S11. 
This is a second-degree equation in x and y; it factorizes into two linear 


equations which represent the pair of tangents from P to E. 
P, J 


Theorem 2 Let P; be a point outside a non-degenerate plane conic E with 
equation s = 0. Then the equation of the tangent pair from P, to E is 


2 
(GDP =o Sill 
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Example 4 Find the equations of the tangents from the point (1, 1) to the 
ellipse with equation x? + 2y* = 1. 


Solution The equation of the ellipse may be written in Joachimsthal’s 
notation as s = 0, where 


s=x?42y*—- 1. 
Since here we have x; = | and y; = 1, we deduce that 
so=?+4+2-7?-1=2 
and 
sp=l-x4+2-1l-y-l=x+4+2y-1. 
The equation of the tangent pair is therefore That is, (sj)? = 8 - sy. 
Gy Se rs 2. 


Multiplying out both sides and rearranging terms, we can rewrite this equation 
in the form 


x? —4xy +2x+4y—-3=0, 
which we can then factorize as 
(x — 1)(x —4y +3) = 0. 


Hence the equations of the two tangents from the point (1, 1) to the ellipse 
arex —1=Oandx —4y+3=0. 


Problem 4 One of the two tangents from the point (2, 1) to the hyper- 
bola 4xy + 1 = 0 has the equation y = x — 1. Find the equation of the 
other. 


Poles and Polars 


In Subsection 1.2.1 we defined the polar of a point with respect to the unit 
circle. We now extend this definition to a general non-degenerate plane conic. 

Let Py; = (x1, y1) be a point outside a non-degenerate plane conic E with p 
equation s = 0 (in Joachimsthal’s notation), and suppose that Pz = (x2, y2) 
and P3 = (x3, y3) are the points where the tangents through P; meet E. These 
tangents have equations s2 = 0 and s3 = 0, respectively. Since P; lies on both 
tangents, it follows that 


s12 = QO and s13= 0. 


It follows from these equations that the points P2 and P3 both lie on the line 
with equation s; = 0, the so-called chord of contact or polar of P; with respect 
to the conic E. 
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Definitions Let E be a non-degenerate plane conic with equation s = 0, 
and let P) = (x1, y) be an arbitrary point of R*. Then the polar of P, with 
respect to E is the line with equation s; = 0. The point P; is called the 
pole of the line s; = 0 with respect to E. 


We have seen that if P; lies outside the conic E, then the polar s; = 0 is the 
chord which passes through the points where the tangent pair from P, touch 
E. Also, if P; is a point on the conic s = 0, then it follows from Theorem | 
that the polar s; = 0 is simply the tangent to E at P;. If P, lies inside the 
conic E, then the polar is simply a particular line outside E that corresponds 
to Py. 


Example 5 Determine the polar of P; = (2,2) with respect to the hyperbola 
E with equation 


3x” — Ixy —y*?+5x-—y—4=0. 
Solution The equation of the hyperbola E may be written in Joachimsthal’s 
notation as s = 0, where 

s = 3x? —2xy — y? 45x —y—4, 


Since here we have x; = 2 and y; = 2, we deduce that 


2-y+x-2 2+x 2+y 
ne ee eg eae oa ee ee 
5, =3 x 5 y+5 5 5 
= 5x 3y 


It follows that the equation of the polar of P; with respect to E has equation 


13 9 _ 13 
3X —5y =0, ory = Fx. 


Problem 5 Determine the polar of (1, —1) with respect to the hyper- 
bola E with equation 2x* + xy — 3y?+x—6=0. 


We are now able to state and prove a stunningly beautiful result concerning 
poles and polars. 
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This definition is 
analogous to the definition 
of polar with respect to a 
circle in Subsection 1.2.1. 


We will see in Theorem 3 
below that this definition 
for the case that P; lies 
inside E results in 
theorems about poles and 
polars where we do not 
need to worry whether a 
point in the plane lies 
outside, on or inside a 
given conic. 


You met this hyperbola EF 
previously, in Example 3. 
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Theorem3 La Hire’s Theorem 
Let E be a non-degenerate plane conic, and let p; be the polar of a point P; 
in R?. Then each point of p; has a polar which passes through P. 


Proof Let P; = (x1, y1) be a point in R2. Then, by definition, the polar p; 
of P; with respect to E has the equation s; = 0. 
Let P2 = (x2, y2) be any point on pj, so that in particular we have 


sj2 = 0. 


But from the definition of polar, we know that the polar of Pz with respect 
to E must have equation sz = 0. It then follows from the equation s;2 = 0 that 
the point P; must lie on the polar of P2 with respect to E. This completes the 
proof. | 


4.2.2 Tangents to Projective Conics 


Earlier, we defined a Line £ to be a tangent to a projective conic E if & meets E 
at precisely one Point P. We then explained the connection between tangents to 
projective conics and tangents to plane conics. If € is a tangent to a projective 
conic E, then the line £’ which represents @ in an embedding plane z is a 
tangent to the plane conic E’ which represents E in zz. 


tangent £' 
toE 


projective 
tangent £ 
toE 


Because of this correspondence, we are able to use results about figures in 
RP” to deduce results about figures in an embedding plane, and vice versa. In 
this subsection we use this correspondence to deduce Joachimsthal’s formu- 
las for projective conics from the corresponding formulas for plane conics. In 
preparation for this, we first extend the concepts of a tangent pair and a polar 
to a projective conic in RP?. 

Let E be a non-degenerate projective conic, and let P be a Point in RP” 
which lies outside E. If x is an embedding plane, then EF is represented in 
by a non-degenerate plane conic E’, and P is represented by a Euclidean point 
P’ in 7. Now, in the embedding plane z we can draw a pair of tangents 0 
and ¢5 from P’ to E’. Back in RP, the planes which pass through the origin 


225 


(X, Yo) 


s=0 


Subsection 4.1.2 


5, =0 
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™ projective 


tangent 


4 a pair from 
a\ PtoE 


le) 


Point P outside E iP 


and the lines £; and €2 are projective tangents to E. These tangents meet at 
the Point P, so they are called the projective tangent pair (or, simply, the 
tangent pair) from the Point P to the projective conic E. 

Now let Q’ and R’ be the points in the embedding plane z at which ¢; and 
£2 touch E’. Then the polar of P’ with respect to E’ is the line € through Q’ 
and R’. Back in RP’, the plane which passes through the origin and ¢ is a Line 
which we call the projective polar (or, simply, polar) of the Point P with 
respect to the projective conic E. 


polar of P 
with respect 
toE 


In order to see how Joachimsthal’s notation can be extended from R? to 
RP?, suppose that the work in Subsection 4.2.1 had all been carried out 
in the embedding plane z = |. To illustrate the ideas involved, recall that 
Joachimsthal’s equation for a plane conic is s = 0, where 


s = Ax? 4+ Bry + Cy*+ Fx+Gy+H. 


Earlier, in Subsection 4.1.1, we described how the equation for a conic in 
RP? can be obtained from the equation of the corresponding plane conic by 


Tangents 


replacing x by x/z and y by y/z, and then multiplying by z? to clear the 
fractions. The equation is still s = 0, but now 


s = Ax? + Bry + Cy* + Fxz+ Gyz+ Hz’. 


In a similar way, Joachimsthal’s expressions, such as s; = 0 for a polar, or 


ay = s- sj, for a tangent pair, still hold in RP? provided that we amend the 


expressions for s; and 51; by replacing x, y,x1, 1 by x/z, y/z,%1/Z1, Y1/Z1> 
respectively, and then multiplying by powers of z and z, to clear fractions. The 
resulting amendments to 51,511, etc., are as specified in the following notation. 


Joachimsthal’s Notation for Projective Conics 
Let a projective conic have equation s = 0, where 


s = Ax? + [E009 Se Cy? + Fxz+Gyz+ Hz’, 
and let [x1, y1,z1] and [x2, y2, z2] be Points of RP”. Then we define 
sy = Axyxt+ 5B(xiy + xy) + Cyy 
ck 5 F(x1z SF 306i) AF 5G(yiz ar WEI) ar ele, 
Si) = Ax? + Bury + Cy? + Fxizi + Gyizi + Aci, 
S12 = Axix2 + 5 B(xiy2 + x21) + Cyiy2 + 4 F (0122 + x221) 
4F 5G (122 ar nei) ar (ele. 


Example 6 Determine s,s; and s12 for the projective conic 
4x? + xy — 2y? — 8xz—2yz +427 =0 
at the Points [x1, y1,21] = [1,0, 2] and [x2, yo, z2] = [1,2, -1]. 


Solution Using Joachimsthal’s notation with A = 4, B = 1,C = —2, 
F = —8,G = —2 and H = 4, we deduce that 


s) =4x + h(y +0) -2-0-4(¢ + 2x) — OF 2y) + 4-22 
= —4x — fy +42; 

sy =4-140-0-8-2-044-4=4; 

sig =4-14+5(24+0) —0—4(-1+2) — 0+ 4) +4-2-(-1) 
=-11. 


With the changes to Joachimsthal’s notation described above, all of the for- 
mulas for polars, tangents and tangent pairs carry over from R* to RP?. We 
therefore have the following theorem. 


We omit the details. 
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Theorem 4 Leta projective conic E in RP? have equation s = 0. 


(a) If P = [x1, y1, z1] lies on E, then the tangent to E at P has equation 
Si =O, 
(b) If P = [x1, y1, z1] lies outside E, then the pair of tangents to E from P 
are given by the equation ae = 9 9 Silke 
(c) If P = [x1, y1, z1] is any Point in RE*, then the polar of P with respect In fact, if P lies inside or 


to E is the Line with equation sj = 0. on E, then we define the 
polar of P to be the Line 


with equation s; = 0. 


Example 7 The projective conic E has equation 
4x? + xy — 2y? — 8xz— 2yz+ 42" =0. 


(a) Determine the equation of the tangent to E at the Point [0, 1, 1]. 

(b) Determine the equations of the two tangents to E that pass through the 
Point [1, 0, 2]. 

(c) Determine the polar of the Point [1,0,2] with respect to E. 

Solution 


(a) Lets = 4x? + xy — 2y? — 8xz — 2yz + 42? and [x1, 1,21] = [0, 1, 1]. 
Then 


51 =0+50+x) -—2y-4O 4x) -(e+y) +42 
— — $x — 3y + 3z. 
Hence the equation of the tangent to E at [0, 1, 1] is 
—ix —3y+3z=0, 
or 
7x + 6y — 6z7 = 0. 
(b) The pair of tangents from the Point [1, 0, 2] to E are given by the equation 
se = 5-511, where, by Example 6, s; = —4x — 3y + 4z and sj; = 4. 
Thus the equation of the tangent pair is 
3 : 2 2 2 
(—4x - 3y +42) = (4x° + xy — 2y* — 8xz — 2yz + 427) - 4, 
or 
16x? + 2y* + 1627 + 12xy — 32xz — I2yz 
= 16x7 + 4xy — 8y? — 32xz—8yz+ 162”. 
After some rearrangement, this becomes 


4y? + 8xy —4yz = 0, 
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or 
y (4ty + 8x — 42) = 0. 
Thus the equations of the two tangents to the projective conic are 
y=0 and 4y+8x-—4z2=0. 


(c) The polar of the Point [1,0, 2] is given by the equation s; = 0, where, by 
Example 6, 


sy = —4x — 3y + 4z. 

Thus the equation of the polar of [1, 0,2] with respect to E is 
—4x — 3y +4z7z=0, 

or 


8x + 3y — 8z=0. 


Problem 6 The projective conic E has equation 
yr +27 + 2xy — Aye + zx = 0. 


(a) Determine the equation of the tangent to E at the Point [1, 0, 0]. 
Verify that [0, 1, —2] lies on this tangent. 

(b) Determine the equations of the two tangents to E that pass through 
the Point [0, 1, —2]. 

(c) Determine the polar of the Point [0, 1, —2] with respect to E. 


43 Theorems 


In this section we use the fact that all non-degenerate (projective) conics are 
projective-congruent, together with the Fundamental Theorem of Projective Subsection 3.3.4, 
Geometry, to prove many interesting results about projective conics. It then Theorem 4 
follows that the corresponding results hold for plane conics too. 


4.3.1 Points on Projective Conics 
Recall that the general form of the equation of a projective conic in RP” is Subsection 4.1.1 
Ax 4 Bxy + cy + Fxz+ Gyz+ Hz =0. 


Although this equation involves six arbitrary constants A, B,C, F,G and H, 
it is only their five ratios that matter. In fact, it takes exactly five Points to 
determine a projective conic. 
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Example 1 Determine the equation of the projective conic which passes 
through the Points [1, 0, 0], [0, 1, 0], [0, 0, 1], [1, 1, 1] and [1, 2, 3]. 


Solution Let the projective conic have equation 
Ax? + Bxy + Cy? + Fxz+ Gyz+ H2? =0. 


Since [1, 0, 0] lies on the projective conic, we must have A = 0. Similarly, 
since [0, 1, 0] and [0, 0, 1] lie on the projective conic, we must also have C = 0 
and H = 0. Thus the equation of the projective conic reduces to the form 


Bxy + Fxz+ Gyz =0. 


Since [1, 1, 1] and [1, 2, 3] both satisfy this equation, we deduce that 


B+F+G=0 (1) 
and 

2B +3F +6G =0. (2) 
Subtracting equation (2) from twice equation (1), we deduce that -F —4G = 0 
so that F = —4G; and subtracting equation (2) from three times equation (1), 


we deduce that B — 3G = 0 so that B = 3G. It follows that the equation of 
the projective conic must be of the form 


3Gxy — 4Gxz+ Gyz = 0, The equation of a 
projective conic is of 
or second degree, and so 
G #0. 


3xy —4xz+ yz=0. 


Problem 1 Determine the equation of the projective conic which 
passes through the Points [1, 0, 0], [0, 1, 0], [0, 0, 1], [1,1,1] and 
[—2, 3, 1]. 


We now use the approach in Example | to prove the following result. 


Theorem 1 Five Points Theorem 

There is a unique non-degenerate projective conic through any given set of 
five Points, no three of which are collinear. In particular, if the five Points 
are [1,0, 0], [0, 1, 0], [0,0, 1], [1, 1, 1] and [a,b,c], then the equation of the 
conic is 


c(a — b)xy + b(c — a)xz+a(b—c)yz=0. 


Proof By the Fundamental Theorem of Projective Geometry, there is a pro- 
jective transformation ¢ which maps four of the Points to [1, 0, 0], [0, 1, 0], 
[0, 0, 1] and [1, 1, 1]. Let [a,b,c] be the image of the fifth Point under rf. 
Since t—! preserves collinearity, it follows that no three of the Points [1, 0, 0], 
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(0, 1, O], [0, O, 1], [1, 1, 1] and [a, b, c] are collinear. This observation enables 
us to deduce that the numbers a, b and c are all different and non-zero. 

For example, b 4 c, for otherwise [a,b,c], [1, 0, 0] and [1, 1, 1] would all 
lie on the Line y = z. Similarly,a A bandc #a. 

Also, c € 0, for otherwise [a,b,c], [1, 0, 0] and [0, 1, 0] would all lie on 
the Line z = 0. Similarly, a 4 0 and b 4 0. 

Since f is a one-one transformation which preserves non-degenerate projec- 
tive conics, the theorem holds if and only if there is a unique non-degenerate 
projective conic through the Points [1, 0, 0], [0, 1, 0], [0, 0, 1], [1, 1, 1] and 
[a, b,c]. In fact, since no degenerate projective conic can pass through [1, 0, 0], 
(0, 1, 0], [0, O, 1], [1, 1, 1] and [a, b,c], it is sufficient to show that there is a 
unique projective conic (with the desired equation) through these Points. 

Now any projective conic has equation 


Ax? + Bxy + Cy? + Fxz+ Gyz + Hz* =0, 
and if it passes through the Point [1, 0, 0], then A = 0. Similarly, if it passes 
through the Points [0, 1, 0] and [0, 0, 1], then C = 0 and H = O. It follows 
that any projective conic which passes through [1, 0, 0], [0, 1, 0] and [0, 0, 1] 
must have an equation of the form 


Bxy + Fxz+Gyz=0, (3) 


for some real numbers B, F and G. 
If the projective conic also passes through the Points [1, 1, 1] and [a,b,c], 
then 
B+F+G=0 (4) 
and 
Bab+ Fac+ Gbhc = 0. (5) 


We may regard equations (4) and (5) as simultaneous equations in B and F. 
If we subtract equation (5) from ab times equation (4), we obtain 
F(ab — ac) + G(ab — bc) = 0,7 
so 
Pong". (6) 


ab —ac’ 


and if we subtract equation (5) from ac times equation (4), we obtain 
B(ac — ab) + G(ac — bc) = 0, 
so 


ac — be 
B=-G 


(7) 


ac —ab™ 
It follows from equations (3), (6) and (7) that any projective conic through 
the Points [1, 0, 0], [0, 1, O], [0, 0, 1], [1, 1, 1] and [a,b,c] must have an 


equation of the form 


ac — be ab — be 
G y xz+Gyz=0, 
ab — ac 


x 
ac —ab 
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Subsection 4.1.1, 
Theorem | 


A degenerate projective 
conic consists of a pair of 
Lines, a single Line, a 
Point, or ‘no Points’. Such 
a projective conic cannot 
pass through five Points 
without three of the Points 
being collinear. 


Note that ab —ac #0 
sincea #Oandb 4c. 


The equation of a 
projective conic is of 
second degree, so G # 0. 
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or 
c(a — b)xy + b(c — a)xz+a(b—c)yz=0. 


Since a, b and c are uniquely determined (up to a multiple) by the fifth Point, 
it follows that the projective conic is unique. | 


Now any theorem which is concerned exclusively with the projective prop- 
erties of Points, Lines and projective conics can be interpreted as a theorem 
about the corresponding points, lines and plane conics in an embedding plane. 
For example, if we are given any set of five points in an embedding plane, no 
three of which are collinear, then Theorem | tells us that there is a unique plane 
conic which passes through the points. In particular, this result remains true if 
we require that none of the five points is an ideal Point for the embedding 
plane, so we have the following result about plane conics. 


Corollary 1 There is a unique plane conic through any given set of five 
points, no three of which are collinear. 


Now consider any four distinct Points A,B,C, D, no three of which are 
collinear. If X is a Point in RP? that does not lie on any of the various 
Lines through A, B,C, D, then Theorem | tells us that there is a unique non- 
degenerate projective conic through A,B,C, D and X. If we now move X 
around RP? (avoiding the various Lines through A,B,C, D) we obtain an 
infinite family of non-degenerate projective conics through A, B,C, D. We 
therefore have the following corollary of Theorem 1. 


Corollary 2 There are infinitely many non-degenerate projective conics 
through any given set of four Points, no three of which are collinear. 


Problem 2 Find the equations of two different (non-degenerate) pro- 
jective conics through the Points [1, 0, 0], [0, 1, 0], [0, 0, 1] and [1, 2, 3]. 


Warning 
We now use this corollary to warn you about a mistake that is frequently made 
in projective geometry. 

The mistake is to assume that there exists a projective transformation t which 
maps one projective conic E; onto another projective conic E in such a way 
that four given Points on FE; are mapped to four given Points on E>. 

Of course, there is certainly a projective transformation t; which maps Fy 
onto E>, and by the Fundamental Theorem of Projective Geometry there is 
certainly a projective transformation t2 which maps the four Points on E, to 
the four Points on E>. The trouble is that t; may not be the same transformation 
as f. 
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If we multiply a, b and c 
by a constant A, then the 
equation of the projective 
conic is multiplied by 47; 
however, this does not 
change the projective 
conic. 


Some of the points may be 
ideal Points for the 
embedding plane. 


| 
A ! D 
B = 
zi | CS 
| 
I 
Warning 


Theorems 


t(A) t(D) 


E, 


1(B) 1(C) 


For example, consider two different projective conics E; and E2 through 
the Points A = [1,0,0],B = [0,1,0],C = [0,0,1],D = [1,1, 1]. (This is 
possible by Corollary 2.) By the Fundamental Theorem of Projective Geometry 
the only projective transformation which maps each of the Points A, B, C, D to 
itself is the identity transformation, and this certainly cannot map E onto Ep. 

The following theorem shows that the situation is very different if, instead 
of having to map four given Points on one projective conic to four given Points 
on another, we have to map just three. 


Theorem 2 Three Points Theorem 

Let E; and E> be non-degenerate projective conics which pass through the 
Points P;, Q;, R; and P2, Qo, R2, respectively. Then there is a projective 
transformation ¢ which maps E£ onto E> in such a way that 


t(Pi\) = Po, t(Q1)=Q2, t(Ri) = Ro. 

Proof First, let tr’ be any projective transformation that maps P;, Q1, Ri to 
[1, 0, 0], [0, 1, 0], [0, 0, 1], respectively. Then t’ maps E, to a non-degenerate 
projective conic E’ which passes through the triangle of reference. If E’ has 


equation 
Ax? + Bry + Cy? + Fxz+ Gyz+ Hz” =0, 
then the fact that the Point [1, 0, 0] lies on E’ forces A to be zero, and the fact 


that the Points [0, 1, 0] and [0, 0, 1] lie on E’ forces C and H to be zero. It 
follows that the equation of E’ can be written in the form 


Bxy + Fxz+ Gyz =0, 


for some non-zero real numbers B, F and G. By dividing by BFG we can 
rewrite the equation of E’ in the form 
xX Y x ZZ y Zz 
G F'G BF 'B 
Now define the projective transformation ¢” by t’([x, y,z]) = 
where 


= 0. 


[x xs 2 I, 


x! /G 0 0 x 
y J=| 0 IWF O y 
Zz 0 0 1/B z 


Then ¢” maps E’ to the conic with equation x’ y’ + x’z' + y’z’ = 0, or, after 
dropping the dashes, to the conic with equation xy + xz+ yz =0. 
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If B were zero, then the 
equation of the projective 
conic could be written as 


(Fx + Gy)z = 0, 


which is the equation of a 
degenerate projective 
conic consisting of two 
Lines. It follows that 

B #0. Similarly, F 4 0 
and G £0. 
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Since t” leaves the triangle of reference unchanged, it follows that the com- For example, 
posite projective transformation t; = to ¢’ maps E; to the projective conic —_¢’”([1, 0, 0]) = [1/G, 0, 0] 
with equation = [1,0 0) 
xy+xzt+yz=0 


in such a way that ¢ (P;) = [1,0, 0], t;(Q1) = [0, 1, 0] and ¢;(R,) = [0,0, 1]. 
Similarly, there is a projective transformation t2 which maps EF» onto the 
projective conic with equation 


xy+xz+yz=0 


in such a way that f2(P2) = [1,0, 0], t2(Q2) = [0, 1, 0] and t2(R2) = [0,0, 1]. 

It follows that the composite projective transformation t = ft, lon maps 
E, onto E> in such a way that t(P}) = P2,t(Q1) = Q2,t(Ri) = Ro, as 
required. a 


Example 2 The Points [1, 1, 1], [1, 2, 2], [1, 2, 1] lie on the projective conic 
E with equation 2x? + 2xy — y? + yz —5xz+2* =0. 


(a) Verify that the projective transformation ft; : [x] +> [x’] with associated 
2 -1 O 
matrix A = -l 0O 1 maps the Points [1, 1, 1], [1, 2, 2], 
0 1 -l 
[1, 2, 1] to the Points [1, 0, 0], [0, 1, 0], [0, 0,1], respectively, and maps E 
onto the projective conic E’ with equation x’y’ — x’z' + y'z' = 0. 


1 & 
Hint: The inverse of A is A7! = 12 2 
12 1 


(b) Determine the equation of the projective conic E” that is the image of E’ 
under the projective transformation fz : [x’] + [x”] with associated matrix 


1 0 0O 
B={ 0 -1 0 
0 0 1 


(c) Hence determine a matrix associated with a projective transformation that 
maps E onto the projective conic with equation xy + yz+ zx = 0. 


Solution 


(a) Let x’ = Ax, so that 


x! 2 -l 0 
y |=] -1 O 
Zz 1 - z 
Since 
-l1 0 1 1 
—1 0 1 1 = 0 |, 
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—1 1 0) 
—l 2)= 1 
1 —- 2 0) 
and 
2 -1 0 1 0) 
—l 0 1 2)=7] 0 |, 
0 1 -l 1 1 


it follows that the images under ¢ of [1, 1, 1], [1, 2, 2], [1, 2, 1] are 
[1, 0, 0], [0, 1, 0], [0, 0, 1], respectively. 
Next, x = A7!x’ so that 


4 1 11 ’ 
y J=] 1 2 2 y’ |; 
z 12 1 Zz! 
thus 
xex' ty +7, 
y =x! +2y' +22’, 
and 


z=x'+2y 42’. 


It follows that t; maps the given projective conic onto the projective conic 
with equation 


Q(x’ + yy 427)? 4200’ + yy’ + 2')(a! + 2y! 4+ 227 
— (x! + Qy! +22’)? + (x! + Qy! + 22’)! + 2y/ + 2’) 
= 5(x' ie y! +4 Zz’) (x! i 2y’ se z’) bs (x! 4 2y’ af. zy — 0, 


After some simplification, this becomes x’y’ —x’z'+ y'z’ = 0, as required. We omit the details. 


1 0 0 
(b) Under the projective transformation tz: x/t> | 0 -1 0O |x’=x’, 
0 oO ll 
we have 
x" =— x’, ” — -y and zZ” — Zz’. 
so that 
x! = x”, y’ = -y" and Zz! = Z”, 


Hence the image under fp of the projective conic x’y’ — x'z’ + y'z’ = Ois 
the projective conic with equation 


URPCLA Wey WoW 
—x"y —x'z>—yz =0, 


or 


UL Wow Wow 


xy +xez +yz =0. 
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(c) It follows from parts (a) and (b) that the projective transformation f2 0 f1, 


with matrix 
1 0 0 -1 
BA=j] 0 -I1 0O —1 0 1 
0 oO 1 1 -1 
2 -1 0 
=| -l 0 -l 
0 1 -l 
maps E onto the projective conic with equation xy + yz + zx = 0, as 
required. 


We can use a similar approach to find a projective transformation that maps 


any given projective conic onto the standard projective conic xy +yz+zx =0, We explain the 


designation ‘standard 
projective conic’ in the 


Strategy To determine a projective transformation f¢ that maps a given es PERGEEEOn: 


projective conic E onto the standard projective conic xy + yz+zx = 0: 


1. choose three Points P, QO, R on E; 

2. determine a matrix A associated with a projective transformation that 
maps P, Q, R onto [1, 0, 0], [0, 1, 0], [0, 0, 1], respectively; 

3. determine the equation Bx’y’ + Fx'z’ + Gy'z’ = 0 of t(E), for some 
real numbers B, F and G; 

4. then a matrix associated with t is BA, where 


We oO 
B={| 0 I/F 0 
OL 0/78 


Problem 3 The Points [—2, 0, 1], [0, —3, 2], [1, —2, 1] lie on the pro- 
jective conic E with equation 17x* + 47xy + 32y* + 67xz + 92yz + 
662° =0. 

(a) Verify that the projective transformation f with an associated matrix 


1 2 3 
A=] 2 3 4 ], 
3 4 6 
maps [—2,0, 1], [0, —3, 2], [1, —2, 1] to [1, 0, 0], [0, 1, 0], [0, 0,1], 
respectively. 
—2 O 1 
(b) Verify that the inverse of A is A~! = 0 3 -2 
1 -—2 1 
(c) Determine the equation of the projective conic t(E). 
(d) Hence determine a matrix associated with a projective transforma- 
tion that maps E onto the projective conic with equation xy + yz+ 
zx = 0. 


Theorems 


4.3.2 The Standard Form xy + yz + zx =0 


In the previous chapter you saw that we can often simplify problems about 
Points and Lines by mapping certain Points to the triangle of reference and the 
unit Point. In a similar way, we can often simplify problems about a projective 
conic by mapping it onto another projective conic that has a simpler equation. 

Since all non-degenerate projective conics are projective-congruent, it 
follows that there is a projective transformation which maps any given non- 
degenerate projective conic in RP* onto the projective conic with equation 
xy+yz+zx = 0. This equation turns out to be particularly useful for tackling 
a large number of problems about projective conics, so we give it a special 
name. 


Definitions The equation 
xy+yz+zx =0 


is called a standard form for the equation of a projective conic. 
The conic defined by this standard form is called a standard projective 
conic. 


The following diagram illustrates this standard projective conic together 
with its representation in the standard embedding plane z = 1. The repre- 
sentation has equation xy + y+x = 0, or (x + 1)(y+ 1) = 1, and is therefore 
a rectangular hyperbola with asymptotes x = —1 and y = —1. 


standard conic 
asymptotes xy+yz+zx=0 
rectangular 
hyperbola 


standard 
embedding 
plane z= 1 


Since this standard projective conic is defined by the equation xy + yz + 
zx = 0, it must pass through the triangle of reference [1, 0, 0], [0, 1, 0], 
[0, 0, 1]. This fact can often be used to simplify calculations involving 
projective conics. Other Points on the projective conic that appear in such 
calculations may then be expressed in terms of a single real parameter. 


Theorem3 Parametrization Theorem 
Let E be a projective conic with equation in the standard form 


2 ap We ar oe =O). 
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Subsection 3.4.1 


Subsection 4.1.4, 
Theorem 5 


The existence of such a 
projective transformation 
was also proved in 
Theorem 2 above, and the 
subsequent Strategy. 


Recall that a rectangular 
hyperbola is a hyperbola 
whose asymptotes meet at 
right angles. 


Notice the unconventional 
orientation of the axes in 
this figure. This is done to 
give a better view of the 
cone and its intersection 
with the embedding plane. 


Recall that the sides of the 
triangle of reference in 
RP? are the x-, y- and 
z-axes in R3. 
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Then each Point on E£, other than [1, 0, 0], has homogeneous coordinates 
of the form [t? + ¢,t + 1,—t], where t € R. Moreover, each such Point lies 
on E. 


Proof Let [x, y, z] be any Point on E. If x = 0, then we must have yz = 0, 
so either y = O (in which case the Point has homogeneous coordinates 
[0, 0, z] = [0,0, 1]) or z = O (in which case the Point has homogeneous coor- 
dinates [0, y, 0] = [0, 1, 0]). A similar discussion of the possibilities when y or 
z 1s zero shows that the only Points on E for which one of the homogeneous 
coordinates vanishes are the three Points [1, 0, 0], [0, 1, 0] and [0, 0,1]. 

Suppose next that [x, y, z] is a Point on E for which none of the coordinates 
vanishes, and let tf = x/y. Then x = ty and so 


(ty)y + yz+z(ty) =0, 
that is, ty? + (t + 1)yz = 0. Since y ¥ 0, it follows that 
ty+(t+ 1z=0. 

Thus y = — (#4) z, and sox = —(f + 1)z. It follows that the Point [x, y, z] 
t+1 
t 

t #0, we may rewrite these coordinates in the form [¢(t + 1),¢ + 1, —f]. 

Also, notice that we can obtain the Point [0, 1, 0] by choosing t = 0, and 
the Point [0, 0, 1] by choosing t = —1; so every Point on E, other than 
[1, 0, 0], can be written in the form [12 +1t,t+1,—t] forsomet e€ R. 


Conversely, every Point of the form [12 +1t,t+1,—t], where t € R, lies on 
E because 


has homogeneous coordinates [-« + 1)z,- ( ) Ve z|i and, since z ~ 0 and 


@+O¢+D4+¢4+ D(O+ (OC +) =0. 


This completes the proof. | 


Using a preliminary projective transformation, we can transform any prob- 
lem involving Points on a projective conic to a problem involving Points on the 
standard projective conic with equation xy + yz+zx = 0. By the Three Points 
Theorem, we can assume that three of the Points are [1, 0, 0], [0, 1, 0] and 
[0, 0,1], and by the Parametrization Theorem we may express any remaining 
Points in the form [12 +t,t-+ 1,—t] for some real number f. 

We illustrate this technique by proving Pascal’s Theorem. 


Theorem 4 __Pascal’s Theorem 

Let A, B,C, A’, B’ and C’ be six distinct Points on a non-degenerate pro- 
jective conic. Let BC and B’C intersect at P,C A’ and C’A intersect at Q, 
and AB’ and A’B intersect at R. Then P, Q and R are collinear. 


We cannot have x, y and z 
all zero for Points [x, y, z] 
in RP?. 

Notice that t 4 0 since 


x £0. 


Blaise Pascal (1623-1662) 
proved this theorem while 
still a schoolboy aged 16. 


Theorems 


Proof By the Three Points Theorem we can let the equation of the projective 
conic be in the standard form xy + yz + zx = 0, with A = [1,0,0],B = 
[0, 1,0], C = [0, 0, 1]. Also, by the Parametrization Theorem, we can let A’ = 
[a + a,a + 1,—a], B’ = [b* +b,b4+ 1,—b],C’ = [c? +ce,c + 1,—c], for 
some real numbers a, b,c. 

First we find the Point P. The Line BC’ joins the Points [0, 1,0] and 
le +c,ct+1, —c], and so clearly has equation x = — (c+ 1)z since both 
Points lie on this Line. The Line B’C joins the Points [b? +b,b+1, —b] and 
[0, 0, 1], and so clearly has equation x = by since both Points lie on this Line. 

The point P lies on both BC’ and B’C, so that its homogeneous coordinates 
[x, y, z| must satisfy the two equations x = — (c + 1) z and x = by. It follows 
that P has homogeneous coordinates [b (c + 1),c + 1, —b]. 

Similar arguments show that the Lines C A’ and C’A have equations x = ay 


and cy = — (c+ 1)z, so that their Point of intersection Q has homogeneous 
coordinates [a (c+ 1),c + 1,—c]. Also, the Lines AB’ and A’B have equa- 
tions by = — (b+ 1)z and x = — (a+ 1)z, so that their Point of intersection 


R has homogeneous coordinates [b (a+ 1),b+ 1, —b]. 
Finally, P, Q and R are collinear since 


b(ic+1) c+1 —b 
a(c+1) c+1 —-c 
bia+1) b+1 —b 
be+b c+1 —-b 
=|act+ta c+l1 -c 
ab+b b+1 —b 
bc—ab c—b 0 
=| acta c+l1 -—c | (rowl—row 3) 
ab+b b+1 —b 
c+1 -c acta —-c 
SHE al cape | abtoy 
= b(c — a)(—b +c) — (c — b)(—ab + be) 
= 0. 


This completes the proof of Pascal’s Theorem. a 
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Notice that in geometric 
illustrations of results in 
projective geometry, we 
usually draw the 
projective conic as a plane 
ellipse in R2. We draw it 
in R2 simply for 
convenience, since the 
page is part of R?! We 
draw it as an ellipse to 
avoid having to cope with 
ideal Points, etc. 


Here we use the 
determinant criterion for 
collinearity given in 
Subsection 3.2.2. 


Elementary row 
operations do not affect 
the linear independence of 
the rows. 
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By representing the configuration of Pascal’s Theorem in an embedding 
plane, we can obtain a version of the theorem that holds in R2. We state this in 
the form of a corollary, as follows. 


Corollary 3. Let A, B,C, A’, B’ and C’ be six distinct points on a non- 
degenerate plane conic, with BC’ and B’C intersecting at P,C A’ and C’A 
intersecting at Q, and AB’ and A’B intersecting at R. Then P, Q and R are 
collinear. 


Notice that because this corollary is stated as a result in R*, certain config- 
urations are excluded. For example, the lines BC’ and B’C cannot be parallel 
since P is assumed to lie in R?. Such cases have to be treated separately. 


Problem 4 Give an interpretation in R* of Pascal’s Theorem for 
which the lines AB’ and A’B are parallel (but AC’ meets A’C and BC’ 
meets B’C). 


4.3.3 Converse of Pascal's Theorem 


Pascal’s Theorem states that if six distinct Points A, B,C, A’, B’ and C’ lie on Subsection 4.3.2, 
a non-degenerate projective conic with BC’ and B’C intersecting at P,C.A’ Theorem 4 

and C’A intersecting at Q, and AB’ and A’B intersecting at R, then P,Q 

and R are collinear. The converse states that if the intersection Points P, Q 

and R are collinear, then the six Points A, B,C, A’, B’ and C’ lie on a non- 

degenerate projective conic. Again, our proof is algebraic, and depends on a 

suitable choice of coordinates. 


Theorem 5 Converse of Pascal’s Theorem 

Let A, B,C, A’, B’ and C’ be six Points, no three of which are collinear, 
with BC’ and B'C intersecting at P,C A’ and C’A intersecting at Q, and 
AB’ and A’B intersecting at R. If P, Q and R are collinear, then the Points 
A, B,C, A’, B’ and C’ lie on a non-degenerate projective conic. 


Proof Since no three of the Points A, B,C, A’, B’ and C’ are collinear, we 
can (by a preliminary projective transformation, if necessary) assume that the Here we are using the 
points A,B,C and A’ have homogeneous coordinates [1, 0, 0], [0, 1, 0], | Fundamental Theorem of 
[0, 0, 1] and [1,1,1], respectively. Suppose that B’ and C’ have homogeneous Projective Geometry. 
coordinates [a, b,c] and [r, s, t], respectively. 

By Theorem | we know that there is a unique non-degenerate projective Subsection 4.3.1 
conic through the five Points A, B,C, A’ and B’, and its equation is 


c(a — b)xy + b(c — a)xz+a(b—c)yz=0. (8) 


We must verify that the Point C’ also lies on this projective conic. We do 
this by calculating the homogeneous coordinates of the Points P,Q and R, 
and using the determinant condition that these three Points are collinear. Subsection 3.2.2 


Theorems 


First, the Line BC’ passes through the Points [0, 1, 0] and [r,s,t], and 
therefore has equation tx = rz. Similarly, the Line B’C passes through the 
Points [a,b,c] and [0, 0, 1], and therefore has equation bx = ay. It follows 
that the Point P has homogeneous coordinates [ar, br, at]. 

Next, the line C A’ passes through the Points [0, 0, 1] and [1, 1, 1], and there- 
fore has equation x = y. Similarly, the line C’A passes through the points 
[r,s,t] and [1,0,0], and therefore has equation ty = sz. It follows that the 
Point Q has homogeneous coordinates [s, 5, f]. 

Finally, the Line AB’ passes through the Points [1, 0, 0] and [a,b,c], and 
therefore has equation cy = bz. Similarly, the Line A’B passes through the 
Points [1, 1, 1] and [0, 1, 0], and therefore has equation x = z. It follows that 
the Point R has homogeneous coordinates [c, b, c]. 

Then it follows from the fact that P, QO and R are collinear that 


ar br at 
Oj} as Ss t 


Ss Ss 


b b 


= ar(sc — bt) — br(sc — ct) +. at(sb — cs). 


—br 


s ¢t 
+ at 
CG. sé 


By rearranging the terms in this equation we get 
c(a — b)rs + b(c — a)rt +a(b—c)st = 0. (9) 
By comparing equations (8) and (9), we observe that equation (8) holds for 
the Point C’ = [r,s,t]. In other words, the Point C’ lies on the protective conic 
through the Points A, B, C, A’ and B’, which has equation (8). This shows that 
A, B,C, A’, B’ and C’ lie on the same (non-degenerate) projective conic, as 
required. im 


The following corollary gives a version of the converse of Pascal’s Theorem 
that holds in R?. 


Corollary 4 Let A, B,C, A’, B’ and C’ be six points in R?, no three of 
which are collinear, with BC’ and B’C intersecting at P,C A’ and C’A 
intersecting at Q, and AB’ and A’B intersecting at R. If P, Q and R are 
collinear, then the points A, B,C, A’, B’ and C’ lie on a non-degenerate 
plane conic. 


We conclude this subsection by showing how this corollary can be used to 
construct further points on the unique plane conic which passes through five 
given points A, B, C, A’ and B’, no three of which are collinear. 

Let T be any point in R* such that AT does not pass through B,C, A’ or 
B’. Then the following construction, using only a straight edge and pencil, 
determines a sixth point, F say, where the conic meets AT. 


1. Draw the lines AB’ and A’B, and so determine the point of intersection R 
of these lines. 
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Recall that the collinearity 
of P,Q and Risa 
hypothesis of the theorem. 
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2. Draw the lines CA’ and TA, and so determine the point of intersection Q of 
these lines. 

3. Draw the lines QR and B’C, and so determine the point of intersection P 
of these lines. 

4. Then the point of intersection F of the lines BP and AT lies on the conic. 


We can see easily why this construction works. The points A, B,C, A’, B’ 
and F are such that the point R where AB’ meets A’B, the point Q where CA’ 
meets FA, and the point P where BF meets B’C are all collinear. It follows 
from the converse of Pascal’s Theorem that the points A, B,C, A’, B’ and F 
all lie on the same plane conic. 

Thus F is the point where AT meets the conic which passes through the five 
points A, B,C, A’ and B’. 


4.3.4 The Standard Form x? + y* = z* 


Since all non-degenerate projective conics are projective-congruent, there is 
a projective transformation that maps any non-degenerate projective conic in 
RP? onto the projective conic with equation x? + y? = z?. This is usually 
the simplest equation to use for problems that involve tangents and polars of 
non-degenerate projective conics. 


Definitions The equation 


P4ryu? 
is called a standard form for the equation of a projective conic. 

The conic defined by this standard form is called a standard projective 
conic. 


This standard projective conic is a right circular cone in R?, as shown in the 
margin. It meets the embedding plane z = | ina circle of unit radius. 

The reason why this standard projective conic is so useful for studying polars 
and tangents is that the equations of its polars and tangents have particularly 
simple forms. 


Theorem 6 Let E be the projective conic with equation 


2 2 2 
Gist Via — oe, 


and let P = [a, b,c] be any Point in RP”. Then: 
(a) if P € E, then ax + by — cz = (is the equation of the tangent to E 
alt [Pe 


(b) if P ¢ E, then ax + by — cz = O is the equation of the polar of P with 
respect to E. 


Subsection 4.1.4, 
Theorem 5 


This is the second 
standard form that we 
have discussed (see 
Subsection 4.3.2 for the 
other). 


Theorems 


(a) (b) 


tangent polar 


P=[a,b,c] 


ax + by—cz=0 ax + by—cz= 


P+ya? e+y=z 


Proof In Joachimsthal’s notation, the standard form becomes s = 0, where 
s = x?4+ y? — z?. So at any Point P = [x1, y1,z1] of RP*, we have s; = 
XX1 + YY — 221. 

Now recall that when P lies on the projective conic, the equation s; = 0 
gives the equation of the tangent to the projective conic at P, and when P does 
not lie on the projective conic, it gives the equation of the polar of P. The 
result follows by noting that at the Point P = [a,b,c] the equation s; = 0 
becomes ax + by — cz = 0. a 


Example 3 Determine whether each of the following Lines touches the pro- 
jective conic with equation x* + y? = z*. For each Line that does, state the 
Point of tangency. 


(a) 3x —5y+4z=0 (b) 3x —4y +5z=0 


Solution If a Line is the tangent to the projective conic x* + y* — 7? = Oat 
some Point P = [a,b,c], say, then its equation must be ax + by — cz = 0 (or 
some multiple of this). 


(a) Comparing the equations 3x — 5y + 4z = 0 and ax + by — cz = 0, we see 
that P must have homogeneous coordinates [3, —5, —4]. However, since 


(3)? + (—5)? — (—4)? = 9+ 25 -— 16 = 18 £0, 


the Point [3, —5, —4] cannot lie on the projective conic; hence the Line 
cannot be a tangent to the projective conic. 

(b) Comparing the equations 3x — 4y + 5z = 0 and ax + by — cz = 0, we see 
that P must have homogeneous coordinates [3, —4, —5]. Since 


GY + (=—4)? = (-5)7 =9 4+ 16 = 25 =90, 


the Line is a tangent to the projective conic at the Point [3, —4, —5]. 


Problem 5 Determine whether each of the following Lines touches 
the projective conic x* + y? = z?. For each Line that does, state the 
Point of tangency. 


(a) 91x — 60y — 109z = 0 (b) 4x +5y + 3z=0 


Subsection 4.2.2 
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We can use these ideas to provide an alternative proof of La Hire’s Theorem 
concerning polars. 


Theorem 7 La Hire’s Theorem 

Let E be a non-degenerate projective conic, and let P be any Point of RP’, 
with polar p with respect to E. Then the polar of any Point Q on p passes 
through P. 


E 
Q=[r,s,t] 


\ ax + by—cz=0 
rx+sy—tz=0 \ 


Proof We may assume that the equation of the projective conic is in the 
standard form x? + y* — z* = 0, and that P has homogeneous coordinates 
[a,b,c]. 

Then, by Theorem 6, the equation of p is 


ax + by —cz=0. 
So if Q = [r,s,t] is any Point on p, then 
ar+bs—ct =0. (10) 


Now, by applying Theorem 6 again, we know that the polar of Q = [r,s, t] 
with respect to the projective conic x* + y* — z* = 0 has equation 


rx +sy—tz=0, 


so, by equation (10), it passes through P. This completes the proof. | 


We now prove the following interesting result as another application of 
Theorem 6. 


Theorem 8 Three Tangents and Three Chords Theorem 

Let a non-degenerate plane conic touch the sides of a triangle at the points 
P,Q and R, respectively, and let the tangents at P,Q and R meet the 
extended chords OR, RP and PQ at the points A, B and C, respectively. 
Then A, B and C are collinear. 


We first proved a version 
of La Hire’s Theorem in 
R2, in Subsection 4.2.1. 


Theorems 


Proof Since the theorem is concerned exclusively with projective proper- 
ties, we shall prove it as a projective theorem about a projective conic. Since 
this projective conic is non-degenerate, we may assume that its equation is in 
the standard form x? + y? — z? = 0, and that the Points P, Q and R have 
homogeneous coordinates [1, 0, 1], [1,0, —1] and [0, 1, 1], respectively. 

By Theorem 6, part (a), the equation of the tangent to the projective conic at 
Pis 


x—-z=0. (1) 
Also, the equation of the chord QR is 
x y Zz 
1 O -1 |=0, 
0 1 1 
which we can rewrite in the form 
x—-y+z=0. (12) 


It follows that at the Point A, both equations (11) and (12) must hold, so that 
x = zand y = x +z = 2z. Hence A must have homogeneous coordinates 
[z, 2z, z] or, equivalently, [1, 2, 1]. 

Similar arguments show that the tangent to the projective conic at Q has 
equation x + z = 0, and the chord PR has equation x + y — z = 0, so their 
Point of intersection B is [—1,2,1], Similarly, the tangent to the projective 
conic at R has equation y — z = 0, and the chord PQ has equation y = 0, so 
their Point of intersection C is [1, 0, O]. 

Finally, A, B and C are collinear since 


f. ~ 4 
2 J = =i 9% 
=f “So 4 =1-| [-2-| +1. | 
_—o* 0 0 1 0 0 
Ss je oS 1. 0 = 


Problem 6 Let £ be the projective conic with equation xy + yz + 
zx = 0, and E) the projective conic with equation x* + y? — z7 = 0. 
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The initial triangle is 
drawn in the diagram with 
bold lines. 


The result then follows as 
an interpretation in an 
embedding plane. 


Here we are using the 
Three Points Theorem 
(Subsection 4.3.1, 
Theorem 2). 


We omit the details. 
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(a) Verify that the Points [1, 0, 0], [0, 1, 0], [0, 0, 1], [2,2,—1] and 
[2, 1,2] lie on EF). 
(b) Determine the images of the Points in part (a) under the projective 


1 -1 O 
transformation ft; with associated matrix A=] 0 O 2 
t db. 2 


(c) Use the results of parts (a) and (b) to write down a matrix associated 
with the projective transformation that maps E, onto E2. 
-1 1 -l 
(d) Hence verify that B = 1 1 -l is a matrix associated 
0 -l O 
with the projective transformation f2 that maps E> onto EF}. 


Problem 7 Using the results of Problem 3, part (b), and Problem 6, 
part (c), determine a matrix associated with a projective transformation 
that maps the projective conic with equation 


17x? + 47xy + 32y? + 67xz + 92yz + 6627 = 0 


onto the projective conic with equation x? + y* — z7 = 0. 


Parametrization of the Projective Conic x2 + y? = z” 


Sometimes it is useful to have a convenient parametrization of Points on the 
projective conic E with equation x? + y? = z?. 

First, notice that we cannot have z = 0 for Points [x, y, z] on E; for then we 
would also have x = y = 0, which is impossible. So, since the coordinates 
[x, y, z] are homogeneous coordinates, we may assume temporarily that z = 1; 
in other words, we consider the intersection of E with the embedding plane 
z=l. 

We may parametrize points of the unit circle x7 + y? = 1 in the embedding 
plane as {(cos 0, sin@, 1) : 6 € (—z,z]}. Putting tf = tan 59, for 6 € (—z,7) 
we obtain the parametrization 

ee] = [1 —7,2,14+2], teR. 
147771+22 
As @ varies over (—z, 70), 50 varies over (— 51, 57). In particular, as 50 varies 
over (—37, 51), t = tan 50 takes all values in R exactly once. Finally, when 
0 = 7, the parametrization (cos 0, sin 9, 1) gives the Point [—1, 0, 1]. 
We can summarize the above discussion as follows: 


Theorem 9 Parametrization Theorem 

Each Point of the projective conic with equation x* + y* = z?, other 
than the Point [—1,0,1], has homogeneous coordinates of the form 
[1 —27,2t,1+ 17], where t € R. 


Earlier we described a 
parametrization of Points 
on the projective conic 
with equation 

xy + yzt+2x= 0. 


We may multiply each 
coordinate by (1 + 1?) 
since they are 
homogeneous coordinates. 


Theorems 


Thus, for example, the Point P with homogeneous coordinates [1. —J/3 : 2| 


on the projective conic E with equation x? + y* = 2? also has homogeneous 
coordinates [. _ wes i]. It follows that at P we may take 


ag <i 2t J3 
—r,=- and —_ =-—. 
1+12 2 1472 2 
From the first equation we have 2(1 — t?) = 1+ f°, so that 1 = 31? or 
t = +1/,/3. Since (from the second equation displayed above) 2t ii C42") 
is negative, it follows that we must have ¢ negative. Hence, at the Point P, 


t=—1/V3. 


Problem 8 Determine the value of the parameter ¢ at the Point 


[1, 24/2, 3] in the parametrization [1 — 17,2t,1+ t?] of the projective 


conic with equation x? + y* = z?. 


This parametrization is often useful, as the following problem illustrates. 


Problem 9 Let E be the projective conic with equation x? + y* = 2”, 


and let A = [0,1,1],B = [1,0,—1],C = [0,1,—1], D = [1,0,1] and 
P=({l- 7,2t,1+ t*], where t € R, be Points on E. Let the tangents 
to E at A, B,C and D meet the tangent to E at P at the Points A’, B’, C’ 
and D’, respectively. 


(a) Determine the equations of the tangents to E at A, B,C, D and P. 

(b) Determine homogeneous coordinates for the Points A’, B’,C’ 
and D’. 

(c) Determine the value of the cross-ratio (A’ B’C’ D’). (Cross-ratio was 
defined in Subsection 3.5.1) 


4.3.5 Some General Remarks 


One of the principal features of our approach to projective geometry is the 
use of algebraic methods (via homogeneous coordinates) to prove geomet- 
ric results. This is analogous to the use of Cartesian coordinates to prove 
geometric results in Euclidean geometry. 

In projective geometry we can often use the Fundamental Theorem of Pro- 
jective Geometry, or the Three Points Theorem for projective conics, to assign 
homogeneous coordinates to particular Points in some convenient way that 
makes the algebra as simple as possible. 

Once we have made the choice of coordinates, the proofs of many geo- 
metric theorems are then simply a question of ploughing carefully through 
algebraic calculations. In a way this is a great advantage, since it is often 
much simpler to do routine algebra to obtain a proof than to sit and wait 
for geometrical inspiration! Unfortunately, the routine nature of the algebraic 
work hides the geometric ideas behind the results themselves. For example, 
when we discussed Desargues’ Theorem in the previous chapter, you probably 


BO ; 
P(l-t’, 21, 1+17] 
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gained a deeper understanding of the theorem from the geometric discussion 
in Subsection 3.1.3 than from the algebraic proof in Subsection 3.4.1. 


4.4 Applying Linear Algebra to Projective Conics 


We now prove that all non-degenerate projective conics are projective- 
congruent. In order to do this, we first describe how to express the general 
equation of a projective conic in matrix form. 

Let E be a projective conic with equation 


Age + Bxy+ Cy+ Fxz+Gyzt+ Hz’ =0, 


and let 
1 1 
A=| 48 c i6 and x=| y |, 
i 1 be 
so that x? = (x y z). It follows that 
A 3B 3F x 
x’ Ax = (x y 2Z) 5B GC 5G y 
5F 5G H z 


= (4x4 5By+ 5Fz 5 Bx +Cy+ 5Gz 
x 
pFx+ $Gy + Hz) y 
z 


= Ax*+ Bxy+ Cy? + Fxz+Gyz+ Hz. 
We may therefore write the equation of the projective conic in the form 
x’ Ax = 0. 


This suggests that we make the following definition. 


Definition Let E be a projective conic with equation 


Ax? + Bxy + Cy? + Fxz+ Gyz+ Hz’ =0. 


Then 
A 3B 35F 
=|) i 
A=| 3B C 36 


is a matrix associated with E. 


For example, a matrix associated with the projective conic with equation 


17x? + 47xy + 32y? + 67xz + 92yz + 6627 = 0 


Note that if A is a matrix 
associated with a 
projective conic E, so also 
is AA for any non-zero 
real number i. 


Applying Linear Algebra to Projective Conics 
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Problem 1 Write down a matrix associated with the projective conic 
given by each of the following equations. 


(a) x? — xy + 3y? —2xz 4 3yz— 52? =0 
(b) 2x? — y? +422 — xy + yz — 32x =0 


We may summarize the above discussion in the form of a theorem, as 
follows. 


Theorem 1 Let E be a projective conic with an associated matrix A. Then 
E has an equation of the form x’ Ax = 0. 


Having expressed the general equation of a projective conic in matrix form, 
we can prove the result we have been seeking. 


Theorem 2 All non-degenerate projective conics are projective-congruent. 


Proof It is sufficient to show that all non-degenerate projective conics are 
projective-congruent to the standard projective conic with equation x? + 
ya=z, 

Let E be any non-degenerate projective conic with equation x’ Ax = 0, 
where A is a matrix associated with E. Then by definition A is a symmetric 
matrix. 

It follows that A has three orthonormal eigenvectors v;,v2 and v3, with 
eigenvalues Aj, Az and (3, respectively. If P is the matrix whose columns are 


the coordinates of v;, V2 and v3, and in this order, then 


A, O O 
D= O rAd O 5 
0 O A 


then P is an orthogonal matrix and P? AP = D. 

Now, the transformation t; of coordinates given by x = Px’ or x’ = P’x 
transforms the projective conic with equation x’ Ax = 0 into a projective conic 
with equation (Px’ )’ A(Px’) = 0. We can rewrite this equation in the form 
(x’)7 (P7 AP)x’ = 0 or (x’) Dx’ = 0; in other words, as 


Aix’)? + daly’)? + As’)? = 0. (1) 


Since the projective conic is non-degenerate, we cannot have all the A’s 
positive or all the A’s negative, since then equation (1) describes only the origin 
in R?. 
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Recall that a matrix A is 
symmetric if AT = A. 


A matrix P is orthogonal 
if PP = Lor, 
equivalently po! =p’. 


Geometrically, this 
transformation corres- 
ponds to a rotation of the 
coordinate axes around 
the origin, keeping the 
origin fixed and the axes 
at right angles to each 
other, possibly composed 
with a reflection in one of 
the coordinate planes. 
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Also, none of the A’s in equation (1) can be zero. For example, if 43 = 0, 
then 4; and Az must be of opposite sign and equation (1) can be written as the 
equation of a pair of Lines in RP?: 


VlArlx’ = #y/lAaly’. 


The only other possibility is that two of the A’s, say A; and A2, are of the 
same sign, positive say, and the sign of the third, A3, is negative. Then equation 
(1) can be rewritten in the form 


[Ar(x’)* + [Aal(y’)? = [A3i(z’)?. 


This is the equation of a cone in R* whose axis is along the z’-axis and whose 
horizontal cross-sections are ellipses. This ‘elliptical’ cone can be mapped 
onto the ‘circular’ cone with equation (x”)* + (y”)? = (z”)* by means of 
a transformation of coordinates x’ +> x” given by x” = Bx’, where 


VlAi] 0 0 
B= 0 VJ |r2| 0 
0 0 VJ |A3| 


After dropping the dashes, it follows that we can map E onto the projective 
conic with equation x* + y? = z? by using the projective transformation t : 
[x] b> [BP’ x]. From the remark at the start of the proof, we conclude that all 
projective conics are projective-congruent. | 


Problem 2 Let E be the projective conic with equation x* — 4xy + 
2y? — 4yz + 377 =0. 


(a) Write down a matrix A associated with E. 
(b) Find an orthogonal matrix P such P’ AP is a diagonal matrix. 
(c) Find a matrix associated with a projective transformation that maps 


E onto the standard projective conic x* + y* = 2”. 


4.5 Duality and Projective Conics 


You saw earlier that in projective geometry there is a duality between Points 
and Lines. For example, the Collinearity Property states that any two Points 
lie on a unique Line, and the Incidence Property states that any two Lines 
meet in a unique Point. In general, the Principle of Duality states that any true 
statement about Points, Lines and their projective properties remains true after 
dualization. 

But why should the Principle of Duality hold? A clue is provided by La 
Hire’s Theorem. Suppose that F is any projective conic. Then with respect to 
this projective conic, every Point P in RP” can be associated with a Line D, 
namely the polar of P with respect to E. By La Hire’s Theorem, any Points 
Q and R on p have polars g and r which pass through the Point P. It follows 


The two Lines are 
coincident if 4; = 0 or 
A2 = 0. 


If necessary, we can 
re-order the eigenvalues 
and eigenvectors to ensure 
that this is the case. 


Subsection 3.4.2 
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that the association of Points with their polars changes collinearity into con- 
currence, and vice versa. This is precisely what is required for the Duality 
Principle. 


We now explore the dual of a projective conic FE. If P is any point on E, 
then its polar with respect to EF is the tangent to EF at P. The discussion above 
suggests that we should be able to dualize the projective conic E by replacing 
each Point on E by the tangent to E at that Point. If we do that, then we 
obtain a collection of Lines which forms an envelope around EF, as shown in E 
the following diagram. 


dual of P 
P 


tangents to E 


At first sight this collection of tangents appears to be rather an unwieldy 
object. Notice, however, that although we usually define a projective conic 
by specifying its family of Points, we could equally well define the projective 
conic by specifying its family of tangents. Both definitions uniquely determine 
the projective conic. When a projective conic is defined by its family of Points, 
we refer to the projective conic as a Point conic. A projective conic which is 
defined by its envelope of tangents is called a Line conic. Of course, E is the 
same projective conic however it is defined, so we say that E is self-dual. 
We have already seen that the Principle of Duality is a powerful tool for dis- _ Subsection 3.4.2 
covering ‘new theorems from old’ when the theorems involve Lines. We now 
investigate an example of its use with projective conics, by dualizing Pascal’s Subsection 4.3.2, 
Theorem. Theorem 4 


Pascal’s Theorem Let A, B,C, A’, B’ and C’ be six distinct Points on a 
non-degenerate projective conic. Let BC’ and B’C intersect at P,C A’ and 
C’A intersect at Q, and AB’ and A’B intersect at R. Then P, O and R are 
collinear. 


We proceed by making the appropriate modifications to the statement of 
Pascal’s Theorem. For clarity and convenience, we reword Pascal’s Theorem 
slightly and set out the theorem and its dual side by side. 
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Pascal’s Theorem 
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Its dual 


Let A, B,C, A’, B’ and C’ 
be six distinct Points 


on a non-degenerate projective conic. 


Let the Lines through B and C’ 
and B’ and C meet at a Point P, 
the Lines through C and A’ 
and C’ and A meet at a Point Q, 
and the Lines through A and B’ 
and A’ and B meet at a Point R. 


Then P, Q and R are collinear. 


Let a,b,c,a’,b’ and c’ 
be six distinct tangents 
to a non-degenerate projective conic. 


Let the Points of intersection of b and 
c’ and b’ and c lie on a Line p, 
the Points of intersection of c and 
a’ and c’ and a lie on a Line q, 
and the Points of intersection of a and 
b! and a’ and b lie on a Line r. 


Then p,g andr are concurrent. 


The dual result is known as Brianchon’s Theorem, after its discoverer, and 
can be reworded rather more memorably, as follows. 


Brianchon’s Theorem The diagonals joining opposite vertices of a (pro- 
jective) hexagon circumscribed around a non-degenerate projective conic 


are concurrent. 


We may interpret this result in the plane R? as follows: if we circumscribe 
a hexagon around a (non-degenerate) plane conic, then the lines joining the 


opposite vertices of the hexagon meet in a single point. 


Problem 1. Write down the result dual to the Five Points Theorem; 
namely, that ‘There is a unique non-degenerate projective conic through 


any given Set of five Points, no three of which are collinear.’ 


Many other beautiful results concerning projective and plane conics can also 


be discovered using the Principle of Duality. 


46 Exercises 


Section 4.1 


1. Find the equation for the projective conic in RP* which corresponds to each 


of the following plane conics in the standard embedding plane z = 1. 


{(.y,2): Ox? + Oxy +y* +x —-3y—-4=0, Z= i} 


{(x.y,2) 23x Oxy — 12y? — 30x 64y +1=0,2=1] 


Pascal’s Theorem can be 
worded similarly as 
follows: the opposite sides 
of a (projective) hexagon 
inscribed in a 
non-degenerate projective 
conic meet in three 
collinear Points. 


Here we are describing 
the intersection of the 
projective conic in 
Brianchon’s Theorem 
with an embedding plane. 


Subsection 4.3.1, 
Theorem | 
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In each case, which ideal Points should be associated with the plane conic? 
2. Which of the following equations define projective conics? 

(a) 2x? + 3z* — y* + xy + 4xz — 3yz = 0 

(b) x? = 4yz 

(c) x7+x-y+y+z=0 

(dd) x7+27+y?-2=0 
3. Which of the following statements are true? 

(a) The projective conic xy + xz + yz = O passes through the Point 

[1, 1, 1]. 

(b) The Point [1, —2, 1] lies on the projective conic a = 4xz. 

4. Determine the eccentricity of the ellipse E in R* with equation 


8u- + 5u* +2u —6=0. 


Section 4.2 


Exercises 1-4 concern the parabola E in R? with equation 
2 2 = 
x + 2xy+ y+ 2x —y—-—3=0. 


1. Determine the ratio in which the parabola E divides the line segment from 
(0, 0) to (1, 2). 
2. Determine the equations of the tangents to E at the points (1,—1) and 
(—3,0). 
3. One of the two tangents from (2, 1) to E has equation y = 2x—3. Determine 
the equation of the other. 
4. (a) Determine the equation of the polar of (0, 3) with respect to E. Verify 
that (2, 1) lies on this polar. 
(b) Determine the equation of the polar of (2, 1) with respect to E. Verify 
that (0, 2) lies on this polar. 
5. This question concerns the projective conic E with equation 


x? + y* — 227 + Ixy — yz + 4zx =0. 


(a) Determine the equation of the tangent to FE at the Point [0,1,—1]. 
Verify that [3, 0, 2] lies on this tangent. 

(b) Determine the equation of the other tangent to E that passes through 
the Point [3, 0, 2]. 

(c) Determine the polar of the Point [3, 0, 2] with respect to E. 


Section 4.3 

Hint for Exercises 2, 3 and 7 Take the equation of the projective conic to be 
xy+yz+zx = 0, and (by the Three Points Theorem) the Points A, B and C to 
have homogeneous coordinates [1, 0,0], [0, 1, 0] and [0, 0, 1], respectively. 


1. Determine the equation of the projective conic through the Points [1, 0, 0], 
[0, 1, OJ, [0, 0, 1], [1, —1, 1] and [4, —1, —3]. 
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. Let E be a projective conic through the vertices of a quadrilateral ABCD. 
Let AB meet CD at P, AC meet BD at Q, and AD meet BC at R. The 
triangle AP QR is called the diagonal triangle of ABCD. 
(a) Prove that the tangents to E at A and B intersect at a Point on QR. 
(b) Prove that the Line PQ is the polar of R. 
. Let E be a projective conic through the vertices of a quadrilateral ABCD, 
and let the tangents to E at A and C meet at P on the Line BD. Show that 
the tangents to E at B and D meet at a Point Q on AC. 
. Let A, B,C, D be the Points [1, 0, 0], [0, 1, 0], [2, 2, —1], [0, 0, 1], 
respectively, on the projective conic E with equation xy + yz + zx = 0. 
Let T be any other Point on E, and let TB and TC meet AD at B’ and C’, 
respectively. Determine the value of the cross-ratio (A B’C’D). 
. Let A,B,C,D and P be the Points [0, 1, 1], [0, 1, —1], [3, 4, 5], 
[5, 12, 13] and [1, 0, 1], respectively, on the projective conic E with equa- 
tion x? + y? = 2”, and let the tangents to E at A, B,C and D meet the 
tangent to E at P at the Points A’, B’, C’ and D’, respectively. 

Determine the value of the cross-ratio (A’B’C’ D’ i. 
. The Points P = [1,—1,1],Q = [1,—2,2] and R = [1, —2, 1] lie on the 
projective conic E with equation 


—2x? + 3xy + 3y? + 6xz + 6yz + 2z7 =0. 


(a) Verify that the projective transformation ¢ : [x] +> [x’], where x’ = Ax 
and A = (-1 " 1) maps P, Q and R to [1, 0, 0], [0, 1, 0] and [0, 0, 1], 
segpecueely. 

(b) Verify that the inverse of A is A7! = (. a 2 ) 

(c) Determine the equation of the projective conic t(£). 

(d) Determine a matrix associated with a projective transformation that 
maps E onto the projective conic with equation xy + yz+ zx = 0. 

(e) Hence determine a matrix associated with a projective transforma- 
tion that maps E onto the projective conic with equation x? + 
ya=z, 

. Let A and B be Points on a projective conic E, and let P be the Point in 

RE? with polar AB. The Line @ through P meets AB at Q, and E at C and 

D. Prove that (PQCD) = —-1. 


Section 4.4 


1. Let E be the projective conic with equation 


x? + Ixy + 3y? + 6xz + 2yz +27 =0. 


(a) Write down a matrix A associated with E. 
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In fact, each pair of 
tangents to E at vertices 
of ABCD intersect on its 
diagonal triangle, and 
each vertex of the 
diagonal triangle has its 
opposite side as polar. 


Cross-ratio was defined in 
Subsection 3.5.1. 


In tackling part (e), you 
should use your result 
from Problem 6, part (c), 
in Subsection 4.3.4. 


We shall use this result in 
Section 8.3. 
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(b) Find an orthogonal matrix P such P’ AP is a diagonal matrix. 


(c) Find a matrix associated with a projective transformation that maps E 


onto the standard projective conic x* + y? = z?. 
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Section 4.1: Projective conics 


1. A projective conic in RP? is a set of Points whose homogeneous coordi- 
nates satisfy a second degree equation of the form Ax” + Bxy + Cy? + 
Fxz+ Gyz+ Hz? =0. 

A projective conic is non-degenerate if it can be represented by a non- 
degenerate conic in the standard embedding plane (which has equation 
z= 1). 

A degenerate projective conic consists of a pair of Lines, a single Line, 
a Point, or ‘no Points’. 

2. Let E be a projective conic, and let 7 be an embedding plane for RP”. 
Then E£ is represented in z by a plane conic E’, where each Point of E 
either pierces z at a Point of E’ or is an ideal Point for z associated with 
E’. The number of associated ideal Points depends on the type of the conic 
E’: there are two such Points if EZ’ is a hyperbola, one if E’is a parabola, 
and none if E’ is an ellipse. 

3. Let t be a projective transformation, and let E be a non-degenerate 
projective conic. Then ¢(£) is a non-degenerate projective conic. 

4. Let E be a non-degenerate projective conic. Then a Line ¢ is a tangent to 
E at P if @ meets E at a Point P, and at no other Point. 

5. Let E be a non-degenerate projective conic. A Point Q lies inside E if 
every Line through Q meets F at two distinct Points. A Point R lies 
outside E if there is a Line through R that meets E at no Points. 

6. Let f be a projective transformation, and let the Line ¢ be a tangent to a 
non-degenerate projective conic EF at a Point P. Then r(€) is a tangent to 
t(E) at t(P). Also, if Q is a Point inside E, then t(Q) lies inside t(E); 
and if R is a Point outside EF, then t(R) lies outside E. 

Tangency and ‘lying inside or outside a projective conic’ are projec- 
tive properties. Hence we can use projective transformations to tackle 
problems involving tangents to plane conics. 

In an embedding plane tangents to projective conics correspond to 
tangents to plane conics, and vice versa. 

7. Eccentricity Formula Let EF be a non-degenerate plane conic with equa- 
tion u2 + Cv? + Gu + H = 0. If E has a focus on the v-axis, then the 
eccentricity e of E is given by the formula e” = 1 — C. 

8. Any line on the surface of a right circular cone passes through the vertex of 
the cone, and is a generator of the cone; that is, the cone can be obtained 
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by rotating that line about a fixed line (the axis of the cone) through 
the vertex. 
9. Every non-degenerate plane conic can be found as the curve of intersection 
of a suitable right circular cone with a suitable plane. 
10. Every ellipse and every parabola occurs as the intersection of any right 
circular cone and a suitable intersecting plane. 

The intersection of a right circular cone and a suitable intersecting plane 
is a hyperbola, with the property that the angle between its asymptotes is 
less than the angle between two opposite generators of the cone. Every 
hyperbola occurs as the intersection of a sufficiently ‘fat’ right circular 
cone and a suitable intersecting plane 

11. All non-degenerate projective conics are projective-congruent. 

12. Three Tangents Theorem Let a non-degenerate plane conic touch the 
sides BC, CA and AB of a triangle AABC in R? at the points P, Q and R, 
respectively. Then AP, BQ and CR are concurrent. 


Section 4.2: Tangents 
1. Joachimsthal’s notation for plane conics Let a plane conic have equation 
s = 0, where s = Ax? + Bxy+ Cy? + Fx + Gy+4+H = 0, and let 
P, = (x1, 91), P2 = (x2, y2) and P3 = (x3, y3) be points of R*. Then we 


define 
XiY + XYi xi +x p+ y 
5; = Axx + BUY 4 cyy 4 F 1 +62 Y +H, 
2 2 2 
Sig = Ax? + Bury; + Cy? + Fxi + Gyi + H, 
XiVj +X Zi x; +x; ne 
sj = Anixy + BT + Cyiy) + F i : j 462! — 1H, 


where i and j can each take the values 1, 2 or 3. 

2. Joachimsthal’s Section Equation The point ("241,42") divides the 
line segment from the point (x1, y,) to the point (x2, y2) in the ratio k:1; 
and lies on the plane conic with equation s = 0 if soak? + 2syok +511 = 0. 

3. The equation of the tangent at the point (x1, y) to the non-degenerate 
plane conic with equation s = 0 is s; = 0. 

4. The equation of the tangent pair from the point (x1, y,) to the non- 
degenerate plane conic with equation s = 0 is (s})? = s- 511. 

5. The polar (or polar line) of a point P (x), y;) with respect to a non- 
degenerate plane conic FE with equation s = 0 is the line with equation 
s, = 0. P is the pole of this line with respect to E. 

If P lies outside EF, then the polar is the line through the two points at 
which the tangents from P meet E. If P lies on E, then the polar of P is 
the tangent to E at P. 

6. La Hire’s Theorem Let E be a non-degenerate plane conic, and let pj, 
be the polar of a point P; in R*. Then each point of p; has a polar which 
passes through P). 
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7. 


10. 


Joachimsthal’s notation for projective conics Let a projective conic 
have equation s = 0, where s = Ax? + Bxy + Cy* + Fxz+Gyz+ 
H2z?, and let [x1, 91,21] and [x2, y2,z2] be Points of RP?. Then we 
define 


sy = Axyx + 5B (x1y +xy1) + Cyy 

+ $F (xz +x21) + 5G (iz t yz) + Azz, 
sy, = Axt + Bury, + Cy, + Fxizi + Gyizi + A271, 
S12 = Axix2 + 5B (x1y2 +. x2y1) + Cyiy2 

+ 5F (4122 + x2z1) + 5G (y1z2 + y2z1) + Hz122. 


The two tangents to a projective conic E that pass through a Point P out- 

side E are called the projective tangent pair (or tangent pair) from P 

to E. 

The projective polar (or polar) of a Point P outside a projective conic E 

with respect to E is the Line through the two Points at which the tangents 

from P to E meet E. 

Let a projective conic E in RP? have equation s = 0. 

(a) If P= [x1, V1, zi] lies on E, then the tangent to E at P has equation 
5, = 0. 

(b) If P= [x 11> zi] lies outside E, then the pair of tangents to FE from 
P are given by the equation se =S- Sq]. 

(c) If P = [x1, 91521] is any Point in RP’, then the polar of P with 
respect to E is the Line with equation s; = 0. 


Section 4.3: Theorems 


1. 


Five Points Theorem There is a unique non-degenerate projective conic 
through any given set of five Points, no three of which are collinear. In 
particular, if the five Points are [1,0,0], [0,1,0], [0,0, 1], [1,1,1] and 
[a,b,c], then the equation of the conic is c (a — b)xy + b(c—a)xz+ 
a(b—c)yz=0. 
There is a unique plane conic through any given set of five points, no three 
of which are collinear. 
There are infinitely many non-degenerate projective conics through any 
given set of four Points, no three of which are collinear. 
Three Points Theorem Let £; and £2 be non-degenerate projective con- 
ics which pass through the Points P;, Q1, R; and P2, Q2, Ro, respectively. 
Then there is a projective transformation ¢t which maps FE; onto £2 in such 
a way that ¢(P)) = P2,t(Q1) = Qo2,t(R1) = Ro. 

It is not always possible to map one projective conic E; onto another 
projective conic E2 in such a way that four given Points of EF, are mapped 
to four given Points of E>. 
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5. Strategy To determine a projective transformation ¢ that maps a given 

projective conic onto the projective conic xy + yz + zx =0: 

1. choose three Points P, Q, R on E; 

2. determine a matrix A associated with a projective transformation that 
maps P, Q, R onto [1, 0, 0], [0, 1, 0], [0, 0, 1], respectively; 

3. determine the equation Bx’ y’ + Fx'z’ + Gy'z’ = 0 of t (E), for some 
real numbers B, F and G; 

4. then a matrix associated with ¢t is BA, where 


ve 6 
B=| 0 I/F 0 
0 oO 1/B 


6. The equation xy + yz+zx = Ois called a standard form for the equation 
of a projective conic. The conic defined by this standard form is called a 
standard projective conic. 

7. Parametrization Theorem Let E be a projective conic with equation in 
the standard form xy + yz + zx = 0. Then each Point of E, other than 
[1,0,0], has homogeneous coordinates of the form [1? +t,t+ 1,-t], 
where t € R. Moreover, each such Point lies on E. 

8. Pascal’s Theorem Let A, B, C, A’, B’ and C’ be six distinct Points on 

a non-degenerate projective conic. Let BC’ and B’C intersect at P, CA’ 
and C’A intersect at Q, and AB’ and A’B intersect at R. Then P, Q and R 
are collinear. 
Corollary Let A, B, C, A’, B’ and C’ be six distinct points on a non- 
degenerate plane conic, with BC’ and B’C intersecting at P, CA’ and 
C’A intersecting at Q, and AB’ and A’B intersecting at R. Then P, Q and 
R are collinear. 

9. Converse of Pascal’s Theorem Let A, B, C, A’, B’ and C’ be six Points, 

no three of which are collinear, with BC’ and B’C intersecting at P, C A’ 
and C’A intersecting at Q, and AB’ and A’B intersecting at R. If P,Q 
and R are collinear, then the Points A, B, C, A’, B’ and C’ lie ona 
non-degenerate projective conic. 
Corollary Let A, B, C, A’, B’ and C’ be six points in R*, no three of 
which are collinear, with BC’ and B’C intersecting at P, CA’ and C’A 
intersecting at Q, and AB’ andA’B intersecting at R. If P,Q and R are 
collinear, then the points A, B, C, A’, B’ and C’ lie on a non-degenerate 
plane conic. 

10. To locate, using only a straight edge and a pencil, further points on the 
unique plane conic through five given points A, B, C, A’ andB’, no three 
of which are collinear: 

1. draw the lines AB’ and A’B, and so determine the point of intersection 
R of these lines; 

2. for any point T in R? such that AT does not pass through B, C, A’ orB’, 
draw the lines CA’ and TA, and so determine the point of intersection 
Q of these lines; 
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3. draw the lines QR and B’C, and so determine the point of intersection 
P of these lines; 

4. then the point of intersection F' of the lines BP and AT lies on the 
conic. 

11. The equation x* + y? = z? is called a standard form for the equation 
of a projective conic. The conic defined by this standard form is called a 
standard projective conic. It is a right circular cone in R?. 

12. Let E be the projective conic with equation x7 + y* = z*, and let P = 
[a, b,c] be any Point in RP*. Then: 

(a) if P € E, then ax + by — cz = Ois the equation of the tangent to E 
at P; 

(b) if P ¢ E, then ax + by — cz = 0 is the equation of the polar of P 
with respect to E. 

13. La Hire’s Theorem Let E be a non-degenerate projective conic, and let 
P be any Point of RP?, with polar p with respect to E. Then the polar of 
any Point Q on p passes through P. 

14. Three Tangents and Three Chords Theorem Let a non-degenerate plane 
conic touch the sides of a triangle at the points P, Q and R, respectively, 
and let the tangents at P, Q and R meet the extended chords QR, RP and 
PQ at the points A, B and C, respectively. Then A, B and C are collinear. 

15. If E, and £2 are the projective conics with equations xy + yz + zx = 0 
and x? + y? = 2”, respectively, then matrices associated with projective 


2 


1 -1 0 
transformations that map E onto E2 and E2 onto Ej are{ O O 2 
1 dt 2 
-! 1 -!t 
and 1 1 —I1 J], respectively. 
0 -1 O 


16. Parametrization Theorem Each Point of the projective conic with equa- 
tion x2 + y? = 2”, other than the Point [—1,0,1] has homogeneous 
coordinates of the form [1 —17,2r,1+ #|, where t € R. 


Section 4.4: Applying Linear Algebra to Projective Conics 
1. Let E bea projective conic with equation Ax? + Bxy+Cy?4+ Fxz+Gyz+ 


A 3B 35F 
Hz =0. Then A = 5B C 5G is a matrix associated with EL. 
5F 3G H 


The equation of E can be written in the form x’ Ax = 0. 
2. All non-degenerate projective conics are projective-congruent. 


Section 4.5: Duality and projective conics 


1. Principle of Duality Any true statement about Points, Lines and their 
projective properties remains true after dualization. 
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2. When a projective conic is defined by its family of Points, we refer to it as a 
Point conic; when we define a projective conic by its envelope of tangents, 
we refer to it as a Line conic. 

3. Every projective conic is self-dual in the sense that the dual of a Point conic 
is the corresponding Line conic, and vice versa. 

4. Brianchon’s Theorem The diagonals joining opposite vertices of a projec- 
tive hexagon circumscribed around a non-degenerate projective conic are 
concurrent. 

This is the dual of Pascal’s Theorem for projective conics. 


Inversive Geometry 


In this chapter we introduce a geometry known as inversive geometry and use 
it to investigate circles and lines. 

We also prove the Apollonian Circles Theorem: namely, that if A and B 
are two given points in the plane, then all the points P in the plane such that 
PA : PB is a fixed ratio k : 1, for some positive real number k 4 1, lie on a 
circle. Notice that if k = 1, the locus of points P such that PA: PB = 1: 1 
is simply the set of points equidistant from A and B, namely the perpendicular 
bisector of the segment AB. 

So for each pair of points A, B there is a family of circles known as the 
circles of Apollonius. Small values of k yield circles close to A, whereas 
large values yield circles close to B. It is sometimes helpful to regard the 
points A and B as point circles, corresponding to k being zero and ‘infinity’, 
respectively. 

The key tool in our work is inversion, which is a generalization of the notion 
of reflection of points in a line. Just as reflection in a line maps points on one 
side of the line to points on the other, so inversion in a circle maps points inside 
the circle to points outside, and vice versa. 

One difficulty that arises when we tackle problems like the Apollonian Cir- 
cles Theorem is the presence of a line amongst what would otherwise be a 
family of circles. Clearly, it would be convenient if we could tackle such prob- 
lems without having to treat the line as a special case. An ingenious way to do 
this is to think of the line as a circle of infinite radius in which the ‘ends’ of 
the line have been ‘joined’ by an additional ‘point at infinity’. This interpreta- 
tion enables us to introduce the term generalized circle to mean either a line 
or a circle. 

To make the idea of a point at infinity precise, we introduce a space known 
as the extended plane. This consists of the ordinary plane R? together with 
an additional point that we define to be the point at infinity. This extended 
plane is the space in which we study inversive geometry. The transformations 
of this geometry, known as inversive transformations, are defined to be the 
composites of inversions. 

Inversive transformations have the remarkable properties that they preserve 
the magnitude of angles between intersecting curves, and they map generalized 


This surprising fact was 


proved by the Greek 


geometer Apollonius of 
Perga (2nd century BC). 


Apollonian circles 
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circles onto generalized circles. These properties enable us to use inversive 
geometry to prove results like the Apollonian Circles Theorem. 

In Section 5.1, we define inversion and study its basic properties. 

In Section 5.2, we introduce the extended plane, and show how certain trans- 
formations of this plane can be represented using complex numbers. We then 
introduce the idea of a generalized circle, and show that every inversion pre- 
serves the magnitude of angles, and maps generalized circles onto generalized 
circles. We end the section by giving an interpretation of the extended plane in 
terms of the so-called Riemann sphere. On this sphere all generalized circles 
actually look like ordinary circles. 

In Section 5.3, we formally define what is meant by an inversive trans- 
formation and inversive geometry. The group of all inversive transformations 
has a subgroup which consists of the so-called Mébius transformations. This 
subgroup is analogous to the subgroup of direct isometries in Euclidean 
geometry. 

In Section 5.4, we prove the Fundamental Theorem of Inversive Geometry: 
namely, that any three points in the extended plane can be mapped onto any 
other three points by an inversive transformation. We use this result to show 
that, in inversive geometry, all generalized circles are congruent. 

Finally, in Section 5.5 we use inversive geometry to prove the Apollonian 
Circles Theorem and to study various families of circles. 


5.1 Inversion 


5.1.1 Reflection and Inversion 


We begin our exploration of inversive geometry by introducing a type of 
transformation known as inversion. These transformations will be used in 
Section 5.3 to define inversive geometry. 

Roughly speaking, an inversion is a transformation of the plane that gener- 
alizes reflection in a line. Instead of mapping points from one side of a line to 
the other, an inversion maps points inside a circle to points outside the circle, 
and vice versa. 

Recall that under reflection in a line £, a point A is mapped to an image point 
A’ that lies an equal distance from @, but on the opposite side of @. In order to 
generalize this notion of a reflection, we shall reformulate it in a way that 
provides us with a sensible analogue when the line @ is replaced by a circle C. 

To do this, let m be a line parallel to AA’ that crosses £ at some point P. 
Under reflection in £, ZPAA’ maps to ZPA'A, so these two angles must be 
equal. But since the lines m and AA’ are parallel, the angle ZPA’A is equal to 
the angle between PA’ and m, so ZPAA' is equal to the angle between PA’ and 
m. This is the clue that we need to generalize reflection. 

By astretch of the imagination, we can think of ¢ as an infinitely large circle 
with m lying along the radius through P. If we replace £ by a circle C of finite 
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radius, and we replace m by a radial line that meets C at some point P, then, 
by analogy with reflection, we can define the image of A to be the point A’ on 
the line segment OA for which ZOPA’ is equal to ZPAO. We say that A’ is the 
point inverse to A with respect to the circle C. 

Of course, for this definition to work, we must check that for a given point 
A, the position of the point A’ is independent of P. To do this, observe that the 
triangles APOA’ and AAOP are similar, for they have a common angle at O 
and ZOPA' = ZOAP. It follows that 


OA’ _ OP 
OP OA’ 
SO 
OA - OA’ = OP”. 


But OP is equal to the radius r of C, so 
OA - OA! = r°. (1) 


Since there is only one point A’ on the line segment OA that satisfies equa- 
tion (1), and since the equation does not depend on P, it follows that the 
position of A’ does not depend on the choice of P. 

Although the above construction illustrates how inversion can be defined 
as a generalization of reflection, it is worth noting that equation (1) is all 
that we need to determine the point A’ that corresponds to a given point A. 
For simplicity, we shall therefore use equation (1) as our formal definition of 
inversion. 


Definitions Let C be a circle with centre O and radius r, and let A be any 
point other than O. If A’ is the point on the line OA that lies on the same 
side of O as A and satisfies the equation 


OA: OA! =r’, (2) 


then we call A’ the inverse of A with respect to the circle C. The point O is 
called the centre of inversion, and C is called the circle of inversion. The 
transformation ¢ defined by 


t(A) = A’ (4 eR? - (0}) 


is known as inversion in C. 


Remark 


Since OA - OA’ = r? is non-zero, neither OA nor OA’ can be zero, and so 
neither A nor A’ can coincide with O. It is for this reason that O is excluded 
from the domain of the transformation ¢ since there is no point to which O can 
be mapped. Likewise, there is no point that maps to O. 
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We can sometimes write down the inverse of a point directly from the above 
definition. For example, if C is the unit circle {(, y)ix27+y= 1}, then 


the inverse of (0, 2) with respect to C is the point (0, 5), and the inverse of 
(-1. 0) is the point (—3, 0). 


Problem 1 Write down the inverse of each of the following points 
with respect to the circle of unit radius, centred at the origin. 


@ 40) 600 ©(0-$) W409) 


Inversion distorts the plane considerably, for it maps points inside a circle C 
to points outside C, and vice versa. Indeed, if OA < r, then OA’ = r2 /OA>r, 
whereas, if OA > r, then OA’ = r?/OA < r. Any point that lies on C maps to 
itself. 


OA < r OA > fr OA =r 


Note that if A’ is the inverse of A, then A must be the inverse of A’, for if 
OA - OA’ = r? then clearly OA’ - OA = r*; we say that A and A’ are inverse 
points with respect to C. In this sense, inversion is like reflection; if we reflect 
a point in a line and then reflect the reflection, we obtain the original point 
back again. Any transformation f that has this property is said to be self-inverse 
because it shows that t~! exists and is equal to f. 


Theorem 1_ Inversion in a circle is a self-inverse transformation. 


Since any transformation that has an inverse is one-one, it follows that 
every inversion is a one-one transformation. Remember, however, that since 
O is excluded from its domain, an inversion is a one-one transformation of 
IR? — {O} onto itself. 

Earlier, we mentioned that inversion can be regarded as a generalization of 
reflection. To see how this follows from the definition of inversion, we examine 
what happens to the inverse of a point A as we increase the radius of the circle. 

Let A be a point outside a circle C with centre O and radius r, let A’ be the 
inverse of A with respect to C, and let the line segment AA’ meet the circle at 
N. Then OA = r + AN and OA’ = r — A‘N, so the equation OA - OA’ = r? 
becomes 


(r + AN)(r — A'N) =r’. 
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After expanding the brackets, cancelling the r* terms, and solving for A’, we 


obtain 
7 AN-r AN 


~ r+AN 1+AN/r° 

Now fix A and N, and let the radius r of the circle tend to infinity. As it 
does so, it follows from the above equation that the length of A’N tends to the 
length of AN. In other words, reflection in a line can be regarded as the limiting 
case of inversion in circles of increasing radii. For this reason, we adopt the 
following useful convention. 


A'N 


Convention We use the term inversion to mean either reflection in a line 
or inversion in a circle. 


The following gives a geometric method for constructing inverse points with 
respect to a given circle. 

Let A be a point outside a circle C with centre O and radius r, let AP and 
AQ be the two tangents from A to C, and let A’ be the point of intersection of 
OA and PQ. Then A and A’ are inverse with respect to C. 

For, in the triangles AOPA’ and AOAP, the angles at O are equal and the two 
angles ZOA’ P and ZOPA are equal, since they are both right angles; hence all 
the angles in the two triangles are equal. Thus the triangles are similar; hence, 
in particular, we must have that 


OA’ OP 
OP OA’ 
so that 
OA: OA! = OP? =r’. 
Thus A and A’ are inverse with respect to C, as claimed. 

We end this subsection by showing that an inversion in a circle is a dif- 
ferent type of transformation from those that arise in Euclidean and affine 
geometry. 

First, an inversion in a circle is not an affine transformation since it does not 
map lines to lines. For example, let C be the unit circle and £ be the line with 
equation x = 2. All the points on ¢ lie outside the unit circle, and so inversion 
in C must map all the points of @ to points that lie inside the unit circle. In 
particular, the image of & cannot be a line. Thus, since the image of a line 
under any affine transformation is itself a line, it follows that inversion cannot 
be an affine transformation. 

An inversion in a circle is not a Euclidean transformation either, for like 
affine transformations, Euclidean transformations map lines to lines. Indeed, a 
Euclidean transformation is just a special type of affine transformation. 


5.1.2. The Effect of Inversion on Lines and Circles 


In the previous subsection you saw how to construct the point A’ that is the 
inverse of a given point A with respect to a given circle C. However, in order 
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to study many of the properties of inversion, we require an algebraic formula 
that relates the coordinates of A and A’. 

For our present purposes it is sufficient to derive the formula for the case 
where C is the unit circle {(x, yix27+y= i}. This circle will occur so 
frequently in our work that we denote it by the special symbol @. 

Let A be a point (x, y) € R?—{O}, and let A’ be its image under inversion in 
the unit circle @. Since A’ lies on the same half-line from the origin as does A, 
it follows that A’ must have coordinates (kx, ky), for some positive number k. 

Since the radius of @ is 1, we must have OA- OA’ = 1. Thus OA?- OA” = 1, 
and so 


A(kx, ky) 


(7 at y’) (x? + ky?) =f 
It follows that 
1 


= — = 
(x? +?’ 


and hence that 
_ 1 
a x2 + y2 7 


Thus A’ is the point (ais, ote), We therefore have the following 
algebraic description of inversion in the unit circle. 


Theorem 2 Inversion in the unit circle @ is the function 


: ab D4 a 
t:(@,y)e (at a) (@» eR (0}). 


We may use this theorem to find the image of any non-zero point of R* under 
inversion in the unit circle @. For example, the image of (3, —2) is the point 


3 —2 =(2 2) 
32 + (—2)2’ 32 + (—2)2 “~4\ 13? 137" 


Problem 2 Determine the image of each of the following points under 
inversion in @. 


(a) (4.1) (b) (3-4) 


Now let (x’, y’) be the image of (x, y) under inversion in @. Since inversion 
is self-inverse, it follows that 


(x, y) = ( i ? y ) 
GPa Gta yy 


Inversion 


Using this relationship between (x, y) and (x’, y’), we can find the image of a 
curve under inversion in @ in much the same way as we found the image of 
a curve in affine geometry. The main difference is that we must be careful to 
remember that inversion is not defined at the origin (the centre of inversion) 
and that no point is mapped to the origin. 


Example1 Determine the image under inversion in @ of the line 2x+4y = 1. 


Solution Let (x, y) be an arbitrary point on the line 2x + 4y = 1, and let 
(x’, y’) be the image of (x, y) under inversion in @. Then 


( ) ( x’ y’ ) 

x,y)= ‘ . 

NCP OD G+ OF 

Since x and y are related by the equation 2x + 4y = 1, it follows that x’ and 
y’ are related by the equation 


2x" n 4y’ zg 
a+)?  @ P4072 ~ 
Multiplying by (x’)* + (y’)?, we obtain 2x’ + 4y’ = (x’)* + (y’)?, and by 
completing the square we may write this as 


(x! = 1)? + (y' — 2)? =5. 


Dropping the dashes, we see that points on the image must have (x, y)- 
coordinates which satisfy the equation 


(x — 1)? + (y—2)? =5. 


This is the equation of a circle with centre (1, 2) and radius /5. Since this 
passes through the origin, and since the origin cannot be part of the image 
under the inversion, the required image must be the circle with the origin 
removed. 


In this example we used dashes to distinguish the coordinates of a point 
(x,y) from the coordinates of its image (x’, y’). However, once you have 
understood the method you may prefer to adopt the strategy below. This uses 
the same method as in the example, but it drops the dashes throughout. 


Strategy To determine an equation for the image of a curve under 
inversion in the unit circle @: 


1. write down an equation that relates the x- and y-coordinates of points on 
the curve; 
2. replace x by ee and y by Fe? and simplify the resulting equation. 
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We shall return to the 
question of what happens 
at the origin later, in 
Subsection 5.2.3. 


For every point on 

(x - 1? +(y- 1 =5 
other than (0, 0) is the 
image of some point on 
2x+4y= 1. 
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When using this strategy to find the image of a curve that passes through the 

origin, we must first remove the origin from the curve. We may also have to 
exclude the origin from 
the image. 


Example 2 Determine the image under inversion in @ of the line y = x (with 
the origin removed). 


Solution Replacing x by yt and y by we obtain 


par ee, 

pe 
y x 

x2 +y? x2 + y2" 


Hence 
y=x. 


This is the line we started with. Just as the origin had to be excluded from that 
line before we could find its image, so the origin has to be excluded from the 
image. Hence the line y = x (with the origin removed) maps to itself under 
inversion in @. 


We shall use the term punctured to describe a line or circle that has one of 
its points removed. More specifically, if A is the point removed from a line or 
circle, then the line or circle is said to be punctured at A. 


Problem 3 Determine the image under inversion in @ of the line 
y = 2x punctured at the origin. 


Problem 4 Let @ be the line with equation x + y = 1. 


(a) Determine the image of @ under inversion in @. 
(b) Explain why the points (1, 0), (0, 1) lie both on @ and on its image. 
(c) Sketch @ and its image on a single diagram. 


The solutions to these problems and the preceding examples illustrate the 
following general result. 


Theorem 3 Images of Lines under Inversion 
Under inversion in a circle with centre O: 


(a) a line that does not pass through O maps onto a circle punctured at O; 
(b) a line punctured at O maps onto itself. 


Proof First choose a pair of coordinate axes with origin at O, and choose a 
unit of length equal to the radius of the circle in which we are inverting. Then 
the circle in which we are inverting becomes the unit circle @. 


(a) If € is a line that does not pass through the origin, then it has an equation 
of the form 
ax + by+c=0, 
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where c is non-zero. Using the above strategy, we know that the image of 
£ under inversion in @ has equation 


ax by 


=0. 
pag Page” 


BY 


Since c is non-zero, we may rewrite this in the form 
x? + y* + (a/c)x + (b/c)y = 0. 


This is the equation of a circle C through the origin. If the origin is 
removed from this circle, then each remaining point A’ is the image of , : 


the point A at which OA’ intersects ¢. It follows that the image of ¢ is the a TF 

whole of the punctured circle C — {O}. g ~ 
(b) Although we could use the strategy to prove this part as well, it is easier a | 

to work directly from the definition of inversion. Indeed, if @ is a line 

punctured at O, then each point of @ inside @ is the image of a point of 

£ outside @, and each point of £ outside @ is the image of a point of @ 


inside @. Points of £ that lie on @ map to themselves. It follows that the 
punctured line £ maps onto itself, as required. | 


Next, we consider the images of circles under inversion. Since points on the 
circle of inversion map to themselves, the image of that circle is also a circle. 
Also, any circle C with centre the centre of inversion O must map onto another 
circle C’ with centre O, for, by symmetry, every point of C is mapped an equal 
distance along a radial line. 

This raises the question as to whether an inversion always maps circles to 
circles. 


Example 3 Use the above strategy to determine the image under inversion in 
© of the circle C with centre (2, 0) and radius 1. 


Solution The circle C has equation (x —2)?+ y? = 1, which we may rewrite 
in the form 


x7 + y?—4x4+3=0. 


Using the above strategy, we deduce that the image of C under inversion in @ Note that the origin does 
has equation not lie on C. 


2 2. 
x 4 y 4x 43=0 
x2 + y? x24 y2 x2 + y?2 — = 


We may add together the first two terms of this equation to obtain 


1 4x 
x2 +y? x2 + y? 
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which we may rearrange in the form 


4 


v+y—4x44=0. 


By completing the square we obtain 


(-) +4=()) 


This is the equation of a circle with centre (3. 0) and radius i. 


So in this example the circle C does indeed map to another circle. Notice, 
however, that the centre (2, 0) of C maps to ( 5 0) which is not the centre of the 
image of C. It follows that even if an inversion maps one circle onto another, 
it may not map the centres to each other. 


Problem 5 Determine the image under inversion in @ of the circle 
with centre (2, 2) and radius 1. 


The next example illustrates what happens when we use the strategy to find 
the image of a circle that passes through the origin. 


Example 4 Let C be the circle with centre (—2, 0) and radius 2, punctured at 
the origin. Determine the image of C under inversion in @. 


Solution The circle C has equation (x + 2)* + y? = 2*, which we may 
rewrite in the form 
ge 4 y* fax = 0. 


Using the above strategy, we deduce that the image of C under inversion in @ 


has equation 
ee hy eee eee 
x24 y2 x2 + y?2 x24 y2 . 


Adding together the first two terms of this equation, we obtain 


1 4x 


= 0, 
Pan) Bae 


which we may rearrange in the form | + 4x = 0. 

It follows that the image of the punctured circle C is the line @ with equation 
re -i. From the figure in the margin it is clear that every point of ¢ is the 
image of some point on C, so there is no need to puncture @. 


Problem 6 Let C be the circle with centre (0. -}) and radius i 


punctured at the origin. Determine the image of C under inversion in @. 
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The conclusions of Examples 3 and 4, and of Problems 5 and 6, suggest the 
following result. 


Theorem 4 Images of Circles under Inversion 
Under inversion in a circle with centre O: 


(a) acircle that does not pass through O maps onto a circle; 
(b) acircle punctured at O maps onto a line that does not pass through O. 


Proof As for Theorem 3, we choose a pair of coordinate axes that makes the 
circle in which we are inverting the unit circle @. 

Now let C be an arbitrary circle with centre (a,b) and radius r. This has 
equation 


@-aP tb =r’, 
which we may rewrite in the form 


x? + yr — 2ax —2by+c=0, Note that this passes 
through O if and only if 
where c = a* + b* — r*. Using the strategy for determining the images of ¢ = 0. 
curves under inversion, we deduce that the image of C under inversion has 
equation 


x 2 4 y ‘i 2ax 2by a 0 
c= U. 
x2 + y?2 x24 y?2 x24 y? x2 + y? 


Adding together the first two terms of this equation, we obtain 


1 2ax 2by 


= 0. 
Poy x+y pay 


By multiplying this equation by (a? + y) , we may rearrange it in the form 
1 — 2ax — 2by +(x? +9?) =O. (3) 


This is the equation of either a line or a circle, depending on whether or not C 
passes through O. 


(a) If C does not pass through O, then c is non-zero. We can therefore divide 
equation (3) by c to obtain 
x? ty?—24x-22y4+i=0. 


This is the equation of a circle on which the image of C must lie. 


272 5: Inversive Geometry 


(b) If C does pass through O, then c = 0, so equation (3) becomes 
1 — 2ax — 2by = 0. 
This is the equation of a line @ that does not pass through O. a 


The following box summaries the results of Theorem 3 and 4. 


Under inversion with respect to a circle with centre O: If the circle of inversion is 
the unit circle @, then O 


a line punctured at O maps onto the same line punctured at O; is the origin. 


a line not through O maps onto acircle punctured at O; 
a circle punctured atO maps onto a line not through O; 
acircle not through O = =maps onto acircle not through O. 


There is no need to remember the details of this summary since its predic- 
tions can be recalled intuitively as follows. First, points on a line or circle 
through the origin can be chosen arbitrarily close to the origin. The images 
of such points can therefore be chosen arbitrarily far from the origin, and 
must therefore lie on a line. Secondly, points on a line can be chosen arbitrar- 
ily far from the origin. The images of these points can be chosen arbitrarily 
close to the origin and must therefore lie on a circle or line punctured at 
the origin. 

With a little practice it is easy to use the summary to simplify the work 
needed to determine the image of a circle or line under an inversion. 


Example 5 Determine the image of each of the following under inversion in 
the unit circle @: 


(a) the line @ with equation x = 2; 
(b) the circle C with centre (0, 2) and radius 1. 
Solution 


(a) From the summary, we know that £ maps to a circle C punctured at 


Y 


the origin. This circle passes through the point (G: 0), since ( 0) 
is the image of the point (2, 0) on the line. Since @ is symmetrical about 
the x-axis, it follows that the image circle must also be symmetrical 
about the x-axis. The only circle C that fulfils all these criteria is the circle 
with radius i and centre (j 0). 

(b) From the summary, the image C’ of C is a circle that does not pass through 
the origin. It must be symmetrical about the y-axis (because C is), and it 


must pass through the points (0. 4) and (0, 1) (the images of (0, 3) and 


HY 


(0, 1), respectively). The only circle C’ that fulfils all these criteria is the 


circle with radius 5 and centre (0. 2). oO 
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Problem 7 Determine the image of each of the following under 
inversion in the unit circle @: 


(a) the line @ with equation y = 1; 
(b) the circle C with centre (0, 1) and radius | (punctured at the origin). 


5.1.3 


In this subsection we shall show that inversion preserves the magnitude of the 
angle at which two curves meet. First, however, we must clarify what it means 
to measure the angle between two intersecting curves. 

In the case of two intersecting lines 2; and 2 there are two ways in which we 
can measure the angle from £; to 02. We can measure it either in a clockwise 
direction, or in an anticlockwise direction, as shown in the margin. Clearly, 
the magnitude of the angle depends on the direction we choose, so when 
specifying an angle we must give both its magnitude and direction. 

In the case of two intersecting curves, we define the angle between the 
curves by using tangents, as follows. 


The Effect of Inversion on Angles 


Definitions Let c; and c2 be two curves that intersect at the point A, and 
let the tangents to the curves at A be ¢; and €2, respectively. Then the 
anticlockwise angle from c, to cz at A is the anticlockwise angle from 1 
to £2, and the clockwise angle from c; to cz at A is the clockwise angle 
from £, to £9. 


To examine what happens to the angles between two curves under an inver- 
sion, it is sufficient to examine what happens to the angles between the 
corresponding tangents. 

For the moment let us concentrate on what happens to a single line @ under 
inversion in a circle centred at O. We know that € maps onto a circle C 
punctured at O. But we can say more, for if m is the line through O that is 
perpendicular to £, then @ is symmetrical about m. It follows that the circle C 
is symmetrical about m, and so € is parallel to the tangent to C at O. We state 
this result as the Symmetry Lemma. 


Lemmal Symmetry Lemma 
Let £ be a line that does not pass through the point O. Then under inversion 
in a circle with centre O, & maps to a circle C (punctured at O), and the 
tangent to C at O is parallel to @. 


Now consider what happens to the angle between two lines £; and €2 which 
intersect at some point A other than O, as shown below. For the moment we 
shall assume that neither line passes through O, so that under the inversion 
the lines map to punctured circles C; and C2, respectively. These punctured 
circles meet at the point A’, where A’ is the image of A. We want to compare 
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anticlockwise angle 


1 
clockwise angle 


, 


anticlockwise angle 


t, 
clockwise 
angle 


We ask you to investigate 
what happens when one of 
the lines passes through O 
in Problem 8. 
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the angle from £; to 2 at A with the angle from C, to C2 at A’. The trick we 
use is to compare both angles with the angle from C, to C2 at O. 


a 


sy 


By the Symmetry Lemma, ¢ is parallel to the tangent m, to C; at O, and 
£ is parallel to the tangent m2 to C2 at O. It follows that the angle from ¢; to 
£9 must be equal in magnitude and direction to the angle we have shown from 
m, tomo. 

Next observe that the reflection in the line through the centres of C, and C2 
sends the tangents m , mz at O to the tangents n1, n2 at A’. Since the reflection 
preserves the magnitude of an angle but changes its orientation, we conclude 
that the angle from n; to nz at A’ must be equal in magnitude but opposite in 
orientation to the angle from m, to mz at O. 

Overall, we have shown that the angle from C; to C2 at A’ must be equal in 
magnitude but opposite in orientation to the angle from ¢, to €2 at A. 


Theorem5 Angle Theorem 
An inversion in any circle preserves the magnitude of angles between curves 
but reverses their orientation. 


The next problem asks you to complete our proof of this theorem. 


Problem 8 Prove the Angle Theorem in the case where one of the 
lines passes through the centre of inversion. 


The Angle Theorem provides us with a very powerful tool for locating the 
images of two or more circles or lines under inversion. 


Example 6 A family of circles shares a common tangent at the origin O, as 
shown in the margin. Describe the effect of inverting the family of circles (all 
punctured at O) in the unit circle @. Illustrate your answer with a sketch. 
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We sometimes abbreviate 
this by saying that the 
angle at A’ is equal but 
opposite to the angle at A. 


Inversion 


Solution Let ¢ be the common tangent to the circles, as shown on the left 
below. If d is the punctured line through O perpendicular to ¢, then each of the 
circles crosses d at right angles. Since all the circles are punctured at O, their 
images under the inversion must be straight lines, as shown on the right. By the 
Angle Theorem, these straight lines must cross the image of d at right angles. 
But d maps onto itself under the inversion, so the punctured circles map to a 
family of parallel lines perpendicular to d. 


ey 
sy 


In fact, apart from £, every line perpendicular to d must be the image of a 
punctured circle in the family. For if a perpendicular line meets d at some point 
A’, then it must be the image of the punctured circle that passes through the 
point A which is mapped to A’ under the inversion. 


Problem 9 One family of circles touches the x-axis at the origin O, 
and another family of circles touches the y-axis at O. Describe the effect 
of inverting the two families of circles (all punctured at O) in the unit 
circle @. Illustrate your answer with a sketch. 


Problem 10 A family of circles intersects at the origin O and at 
another point A, as shown in the margin. Describe the effect of inverting 
the family of circles (all punctured at O) in the unit circle @. Illustrate 
your answer with a sketch. 


Problem 11 Let C,, C2 and C3 be circles in the plane such that C; 
and C» touch at the origin O, C3 and C, touch at another point A, and 
C2 and C3 touch at the further point B. Describe the effect of inverting 
the circles C,;, C2 (both punctured at O) and C3 in the unit circle @. 
Illustrate your answer with a sketch. 


These problems illustrate that, by carefully choosing centres of inversion, 
we can transform some of the circles in a figure to straight lines. Since straight 
lines are often easier to deal with than are circles, we can use such transformed 
figures to investigate those properties of the original figures that are preserved 
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by inversions. In essence, this is the idea that underlies inversive geometry, and 
you will see several example of its use in Section 5.5. 


5.2 Extending the Plane 


In order to define a geometry in which we can study the properties of cir- 
cles, lines and angles, we need a group of transformations that preserve these 
properties. Among the transformations that do this are the Euclidean transfor- 
mations, and the inversions introduced in the previous section. In this section 
we describe how such transformations can be represented in terms of complex 
numbers. This will enable us to manipulate the transformations by using the 
algebra of complex numbers. 


5.2.1. Transformations of the Complex Plane 


We begin by reminding you of some facts concerning complex numbers. 

First, there is a one—one correspondence between points (x, y) in the plane 
IR? and complex numbers z = x+iy in the complex plane C; we call x and y the 
real part and the imaginary part of the complex number z, and denote them 
by the symbols ‘Re z’ and ‘Im z’, respectively. All the arithmetic operations 
may be carried out in C as for real numbers, except that we replace i by —1 
wherever i* occurs. 


If z is the complex number x + iy, then its conjugate Z is defined by 
Z=x-—iy, 


and its modulus |z| is defined by 


Iz) =y/x24 y?. 


Recall that |z|? = zz. 
Problem 1 Let z; = 2 — 3i and z2 = —3 4+ 4i. 
(a) Determine each of the following complex numbers in Cartesian 


form. 


(i) z1 + z2 (ii) Z1 — Z2 (iii) 2122 
(iv) 21/22 (v) Zr (vi) Z2 


(b) Determine |z;| and |z2|. 
If a non-zero complex number z = x + iy has modulus r, and if the position 


vector of the point (x, y) lies at an angle 6 to the positive x-axis, then we can 
express z in the form 


r(cos@ +isin@). 
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Recall that x + iy is the 
Cartesian form of the 
complex number. 


z=xtiy 


Zax-iy 


z=r(cos 0+isin 0) 


Extending the Plane 


Such an expression is known as a polar form of z, and the angle 0 is known 
as an argument of z, written arg z. Polar forms and arguments are not unique, 
since the same z could equally well be expressed in the form 


r(cos(@ + 27n) +i sin(@ + 27n)) 


for any integer n. The principal argument of z, written Arg z, is the unique 
value of arg z that lies in the interval (—z, 7]. 

We can obtain the Cartesian form of a complex number from any of its polar 
forms by using the equations 


x=rcos@é and y=rsing, 


and we can obtain a polar form of a complex number from its Cartesian form 
by using the equations 


x 
r= /x2+y?, cos@=— and sino =~. 
r r 


Problem 2 Determine the polar forms of the complex numbers 
zy = 1 —i and z. = —V3 +i in terms of their principal arguments. 


The following strategy can be used to multiply and divide complex numbers 
given in polar form. 


Strategy To multiply two complex numbers given in polar form, multiply 
their moduli and add their arguments. 

To divide two complex numbers given in polar form, divide their moduli 
and subtract their arguments. 


Although this strategy gives an argument for a product or quotient, you 
may require the principal argument, in which case you will need to adjust 
the argument by adding an appropriate multiple of 27. 


Problem 3 Let z} = 1 —i and z = —J3 +i. Determine the 
polar forms of the following complex numbers in terms of their principal 
arguments. 


(a) z1z2 (b) z1/z2 


We can now use the algebra of complex numbers described above to 
represent some of the basic Euclidean transformations of the plane. 


Translations 


First, consider the transformation 


t(iz)=zte (zECQ), (1) 
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where c = a + ib. This maps an arbitrary point x + iy € C to the point 


ztec 
(x +a) +i(y +5), 
b 
and therefore corresponds to a translation through the vector (a,b). Clearly, z 
such transformations preserve angles, and map circles and lines to circles 
and lines. : 


Reflection in the x-Axis 


Next, consider the transformation 


tzy=z (EC). (2) LA | 
<i 


This maps an arbitrary point x + iy € C to the point x — iy, and therefore 
corresponds to a reflection in the x-axis. It maps circles and lines to circles 
and lines, and it preserves the magnitude of angles; however, the orientation of 
angles is reversed. 


“| @<—_—___}__ pen 


Rotation about the Origin 


Now, consider the transformation 


t(z) =az (ze), (3) 
where |a| = 1. Since |a]| = 1 we can write a = cos@) + isin, where 
6) = Arg a. From the above strategy, t maps an arbitrary point r (cos 6+ sin 0) 
in C to the point 


r(cos(@ + 69) +7 sin(@ + 6)), 


and therefore corresponds to a rotation through the angle 6) = Arga about the 
origin. The rotation is clockwise if Arg a < 0 and anticlockwise if Arg a > 0. 


Arbitrary lsometries 


All the other isometries can be represented in the complex plane as composites 
of the basic transformations described above. In fact, we have the following 
result. 


Theorem 1 Each isometry ¢ of the plane can be represented in the 
complex plane by one of the functions 


t(z)=az+b or t(z)=az+b, 


where a,be€C, |a|=1. Conversely, all such functions represent 
isometries. 


Proof The converse is easy to prove, because every function of the type 
described is a composite of the basic isometries described above. 
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So let t be an isometry of the complex plane, and let t(0) = b, t(1) = c. If 
we denote c — b by a, then |a| is the distance between £(0) and t(1), and since 
t is an isometry it follows that |a| = 1. Now let s be the isometry defined by 
s(z) = az+); then 


s(0)=b=t(0) and s(1)=a+b=c=X(l). 


Thus s~! of is an isometry that fixes 0 and 1, and so, since it is an isometry, it 
must fix each point of the x-axis. 

Then using the fact that s~! o ¢ is an isometry (that is, it does not alter 
distances) that leaves each point of the x-axis unaltered, we may deduce that 
s—! ot is EITHER the identity transformation OR a reflection in the x-axis. If 
it is the identity, then s—! ot(z) = z, in which case f(z) = s(z) = az+b. If it 
is a reflection, then s~! 0 f(z) = Z, in which case ¢(z) = s(Z) = aZ+b. a 


Theorem | may be interpreted as saying that every isometry of the com- 
plex plane can be obtained as a rotation (through an angle Arg a) followed 
by a translation (through the vector (Re b, Im b)), possibly all preceded by a 
reflection in the real axis. 

Slightly more surprising is the fact that every isometry of the plane can be 
expressed as a composite of reflections alone. This is because each rotation 
and each translation can be expressed as a composite of two reflections. 


rotation 
through @ 


~+— 


Thus a rotation about a point O can be expressed as a reflection in any line 
through O followed by a reflection in a second line through O, where the angle 
from the first line to the second line is half the desired angle rotation. 

Similarly, a translation can be expressed as a reflection in any line perpendic- 
ular to the direction of the translation followed by a reflection in a second line 
that is parallel to the first and in the direction of the translation relative to the 
first line. The distance between the lines is half the distance of the translation. 

So we have the following result. 


Theorem 2 Every isometry can be expressed as a composite of reflections. 


Example 1 Let ¢ be the isometry defined by 
t(z) =iz+44+2i (zeEC). 
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(a) Show that f represents an isometry. 
(b) Interpret ¢ as the composite of a reflection, a rotation and a translation. 
(c) Interpret ¢ as a composite of reflections. 


Solution 


(a) The coefficient of z is i, which has modulus |i| = 1. By Theorem 1, it 
follows that f is an isometry. 

(b) In the formula that defines t, the conjugation corresponds to a reflection in 
the x-axis, the multiplication by i corresponds to an anticlockwise rotation 
through z/2, and the addition of 4+2i corresponds to a translation through 
the vector (4, 2). 

(c) First, let r be the reflection in the x-axis that corresponds to the 
conjugation. 

Next, observe that the anticlockwise rotation through 2/2 can be inter- 
preted as the composite r; or, where r is the reflection in the x-axis again, 
and r; is the reflection in the line y = x through the origin that makes an 
angle 2/4 with the x-axis. 

Finally, observe that the translation through the vector (4, 2) can be 
interpreted as the composite r3 o r2, where rz is the reflection in the 
line 4x + 2y = 0 through the origin that is perpendicular to the vector 
(4, 2), and 73 is the reflection in the parallel line 4x + 2y = 10 that passes 
through 5 (4,2) = (2,1). 

Overall, we have t = r3 0r2 or, Oror or, since r is its Own inverse, 


t=73072 074. 


Problem 4 Let ¢ be the transformation defined by 
t(z)=—iz+6-4i (zeEC). 
(a) Show that f represents an isometry. 
(b) Interpret t as the composite of a rotation and a translation. 
(c) Interpret f as a composite of reflections. 
Hint: Use the decompositions described in the margin. 


Having discussed isometries, we now turn our attention to two other trans- 
formations of the complex plane that preserve the magnitude of angles and 
map circles and lines to circles and lines. 


Scalings 
The transformation defined by 

t(z) =kz (z€C), (4) 
where k is real and positive, multiplies the modulus of each complex number 
by a factor k but leaves its argument unchanged. It is therefore a scaling by the 


factor k. Clearly, scalings preserve angles, and map circles and lines to circles 
and lines. 
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The decompositions 
illustrated above 
generalize as follows. 


To rotate the plane R? 
through an angle 6: first 
reflect in the line 

{z : Arg z = O}, then 
reflect in the line 

{z: Arg z = 46}. 

To translate the plane R?2 
through a vector (a, b): 
first reflect in the line 
{(x, y) ax + by = 0}, 
then reflect in the line 
{(x, y) :ax+ by = 

5 (a + b*)}. 


Extending the Plane 


Inversions 


The primary reason for introducing complex numbers into our discussion is 
that they provide a particularly convenient way in which to represent the effect 
of an inversion on points in the plane. 


Theorem 3 An inversion in a circle C of radius r with centre (a,b) may 
be represented in the complex plane by the transformation 


2 


NO een (ele ten 
aa Gi 


where c = a + ib. 


Proof We first consider the case where C is the unit circle @. The image 
under inversion in @ of the point (x, y) is the point (ai. aia), We may 
reformulate this expression in terms of complex numbers, by using the fact that 
the modulus |z| of a complex number z = x + iy satisfies the identity 


a yr = (z|? = ZZ. 


Thus the image under inversion of the point z = x + iy is the point 


x . oy x +iy Zz 1 


+i = = z= 
x2 py? x2py2 x24 y2 2% Z 
Next we consider the general case where C is a circle of radius r with centre 
(a,b). In this case the inversion in C can be expressed as the composite t = 
t3 0 f2 o fy, where 
th(z) = << is a translation and a scaling that sends C to the unit circle; 
t2(z) = 1/z is the inversion in the unit circle; 


3(z) = rz+c is the inverse of t; and sends the unit circle back to C. 


Then 


r 
t(z) = fof of}(z) = —— + ¢, 
x 


as required. | 


The representation of inversion provided by Theorem 3 has a particularly 
simple form in the case where C is the unit circle @. In that case, r = 1 and 
c = 0 so the inversion is represented by the transformation 


t(z) = (ze C—{O}). 


ATL Re 
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By Theorem 2, 
Subsection 5.1.2 


You met this formula in 
the proof of Theorem 3 
above. 
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For example, the image of the point 1 — 7 under inversion in the unit circle @ 
is the point 
a 1-i 
T-i 14% G4)0=9 


II 
NI 
— 
= 
~ 
wm 


Problem 5 Determine, in Cartesian form, the image under inversion 
in the unit circle @ of each of the following points. 


(a) -V3+i = (0) -3 - 47 


Problem 6 Let C be the circle of radius 2 with centre at the origin. 
Write down the inversion in C as a transformation of C — {O}. 


We know that inversions preserve the magnitude of angles. But what about 
their effect on circles and lines? Unfortunately, it is not strictly correct to say 
that an inversion maps circles and lines to circles and lines since some of the 
circles and lines may have to be punctured before the map can be carried out. 
We describe how to overcome this complication later, in Subsection 5.2.3. 


5.2.2 Linear and Reciprocal Functions 


We can use the ‘basic’ complex functions described in the previous subsec- 
tion to give a geometric interpretation of many other complex functions. We 
illustrate this for the so-called linear and reciprocal functions. 


Definition A linear function is a function of the form 
t(z)=az+b (EO), 
where a,b € Canda £0. 


Every linear function t(z) = az + b can be decomposed into a composite 
tz o ty where 


t; is the scaling t)(z) = |a|z, 
ty is the isometry f2(z) = (a/la|)z + b. 


The geometrical interpretation of the linear function t depends on how we 
choose to interpret the isometry f2. We can say that the linear function is a 
scaling by the factor |a|, followed by a rotation through the angle Arg (a/|a|), 
followed by a translation through the vector (Reb, Imb). Alternatively, we can 
use Theorem 2 and say that the linear function is a scaling composed with a 
number of reflections. 

Either way, since both the scaling and the isometry preserve angles and map 
circles and lines to circles and lines, it follows that the same must be true of all 
linear functions. 


5: Inversive Geometry 


Note that the technique of 
multiplying both 
numerator and 
denominator by the 
conjugate of the 
denominator is often 
useful. 


By Theorem, 
Subsection 5.1.3 
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Next, we consider the reciprocal function. 


Definition The reciprocal function is defined by 


Ag) = ; (zg €C —{O}). 


This function can be decomposed into the composite f2 0 tj where 


ty is the inversion t;(z) = 4, 
tz 1s the conjugation f2(z) = Z. 


It follows that, geometrically, the reciprocal function can be interpreted as an 
inversion in the unit circle @ followed by a reflection in the real axis, as shown 
below. 


Xz) =t,0t, (2) = Wz 


Since the inversion and the conjugation are both one—one functions that pre- 
serve the magnitude of angles and reverse their orientation, it follows that 
the reciprocal function must be a one-one function that preserves both the 
magnitude and the orientation of angles. 

As in the case of inversions, it is not strictly correct to say that the reciprocal 
function maps circles and lines to circles and lines, because some of the cir- 
cles and lines may have to be punctured before the reciprocal function can be 
applied. We next show how we can overcome this complication by extending 
the complex plane. 


5.2.3 The Extended Plane 


So far, our discussion of the effect of inversion on lines and circles has been 
complicated by the need to puncture those lines and circles that pass through 
the centre of inversion. Also, the need to distinguish between those images 
that are circles and those that are lines makes the description of the inversion 
process somewhat cumbersome. We can deal with both these difficulties in 
a very elegant way by adding an additional point to the plane to obtain the 
so-called extended complex plane. 
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To illustrate the ideas involved, consider the line @ with equation x = 2. 
Recall that inversion in the unit circle @ maps € to a circle C with radius 4 and 
centre (j: 0), punctured at the origin. 

The point (2, 0) is mapped to (5,0), (2, 1) is mapped to (2, 4), (2, 2) is 
mapped to (j is (2, 3) is mapped to (7: =) and, in general, the point (2, y) 


is mapped to the point Creel ae on C. As the point (2, y) moves from 


(2, 0) up the line @, its image under inversion moves from G. 0) around the 
circle C towards the origin in an anticlockwise direction. Similarly, as the point 
(2, y) moves from (2, 0) down the line @, its image under inversion moves from 
( 7 0) around the circle C towards the origin in a clockwise direction. The ‘gap’ 
in the circle C at the origin arises because there is no point on @, or indeed 
any point in R?, that is inverted to the origin. To ‘fill the gap’ we attach an 
additional point at infinity to the plane. 


Definitions The extended plane is the union of the Euclidean plane R? 
and one extra point, the point at infinity, denoted by the symbol oo. When 
we wish to consider the plane as the complex plane C, then we call the 
extended plane the extended complex plane and denote it by the symbol 
C; thus = CU {oo}. 


Remarks 


1. The extended plane and the extended complex plane both consist of the 
ordinary plane together with the point oo. We use both terms interchange- 
ably, depending on whether we wish to think of the ‘ordinary’ points in 
the plane as being represented by pairs of real numbers or by complex 
numbers. 

2. The symbol oo does not represent a complex number and so it should not be 
used in association with arithmetic operations that act on complex numbers. 
For example, oo + 37 is a meaningless expression. 


Having extended the plane in this way, we now extend the definition of inver- 
sion in a circle with centre O. We simply define the image of O to be oo, and 
the image of 00 to be O. Other points are mapped as specified by the definition 
of inversion given in Subsection 5.1.1. 

With this extended definition of inversion, the point O corresponds to oo, 
and so the circle C in the above discussion now corresponds under inversion 
in @ to the set £ U {oo}. This certainly fills the ‘gap’ in C, but how can the set 
£U {co} be interpreted? 

As a point A on the circle C moves anticlockwise towards O, its image A’ 
under inversion moves up £. When A reaches O, A’ reaches oo. As A continues 
around C below O, its image returns up @ from below. You can think of the 
point oo as ‘linking’ the two ends of the line @, thereby enabling points to 
travel ‘round and round’ the line. With some stretch of your imagination, you 
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Example 5, part (a), 
Subsection 5.1.2 


C 
C 4(2,y) 
\ a a 


c is read as ‘C hat’. 


When we wish to think of 
points as complex 
numbers, we often denote 
them by lower-case letters 
such as z or a rather than 
the upper case letters 
usually used for points. 


A more formal definition 
of inversion in C is given 
below. 
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can therefore think of the line @ as a circle of infinite radius, where the “gap at 
infinity’ has been filled by the point oo. 

Any line @ in the plane may have its ‘gap at infinity’ filled by forming 
the set € U {oo}. Such a set is called an extended line. Since an extended 
line can be thought of as a circle of infinite radius, we make the following 
definition. 


Definition A generalized circle in the extended plane is a set that is either 
a circle or an extended line. 


Remark 


With this definition, you can think of an extended line as a generalized circle 
which passes through oo, and you can think of an ordinary line as a generalized 
circle which has been punctured at oo. 

Recall that in Subsection 5.1.1 we adopted the convention that an inversion 
is either a reflection in a line, or an inversion with respect to a circle. We can 
now regard both of these as inversions in generalized circles. 


Definition Let C be a generalized circle in the extended complex plane. 
Then an inversion of the extended plane with respect to C is a function ¢ 
defined by one of the following rules: 


(a) if C is acircle of radius r with centre O, then 


the inverse of A with respect toC, if A € C — {O}, 
t(A) = { ~, it Al = ©, 
O, iit Al = Coe 


(b) if C is an extended line ¢ U {oo}, then 


the reflection of Ain?, if A €C, 
oo, it Al = 6S. 


(Ay) = | 


Remark 


Note that any inversion in an extended line fixes the point at infinity. Con- 
versely, every inversion that fixes oo must be an inversion in an extended line. 

The above definition of inversion ensures that generalized circles map to 
generalized circles. Indeed, we already know from Subsection 5.1.2 that this is 
true if we allow the circles to be punctured, but we still need to check that the 
gap in a punctured circle is mapped to the gap in its image circle. 

In the case of inversion with respect to a circle with centre O, we know that 
a circle or line punctured at O maps to a line, and this is consistent with the 
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Remember that A’ is the 
inverse of A with respect 
to C if it lies on the same 
radial half-line from O as 
A, and OA - OA’! = r?. 
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fact that O maps to oo. Also, a line maps to a circle or line that is punctured at 
O, and this is consistent with the fact that oo maps to O. 

In the case of inversion with respect to an extended line, we know that lines 
reflect to lines, which is consistent with the fact that oo maps to oo. Ordinary 
circles are not a problem since they reflect onto circles. 

We therefore have the following important result. 


Theorem 4 _Inversions of the extended plane map generalized circles onto 
generalized circles. 


We shall sometimes find it convenient to write inversions of the extended 
plane as inversions of C. For example, by Theorem 3, we can write the 
inversion of the extended plane with respect to the unit circle @ in the form 


1 
=, ifzeC— {Od}, 


t(z) = 


Problem 7 Write down each of the following inversions of the 
extended plane as a transformation of C: 


(a) the inversion with respect to the circle of radius 2 with centre the 
origin; 
(b) the inversion with respect to the extended real axis. 


The inversion of the extended plane that we asked you to write down in 
Problem 7, part (b), is particularly important because it provides us with a 
natural way of extending the conjugation function from C to C. 


Definition The function t : C > C defined by 


z wz2EeC, 


Ne) = 


CO, ih? =o, 


is called the extended conjugation function. 


This function occurs so frequently that we shall introduce a notation for the 
images of the points in its domain. Since we already have the notation z for 
the conjugate of a complex number z, we simply adopt the convention that 
= 00. 

Inversions are not the only transformations that can be extended to ¢. 
Indeed, most of the transformations discussed at the beginning of this section 
can be extended in a natural way to c 


5: Inversive Geometry 


Subsection 5.2.1 


Extending the Plane 


Definitions 
(a) The function ¢ : C > C defined by 


1 
= ifzeC—tor 


6S a eer ery 


O, tii'y = Cs, 


is called the extended reciprocal function. 
(b) A function t : C > C of the form 
az+b, ifzeC, 


t(z) = 
oo, ili Z = CS, 


where a, b € Canda £ 0, is called an extended linear function. 


The solution to the next example shows that the extended reciprocal function 


can be expressed as a composite of two inversions. 


Example 2 Find the composite t = f2 o tf; where 


t, is the inversion in the unit circle @, 
tz 1s the extended conjugation function. 


Solution Since 


1 
<, “#7 eC = {0}, 
z, ifzeC, 


=? 
Zs 


and fo(z) = ; 
ifz=o, 


t(z) = 00, feu S 


0, ifz=oa, 


we have 
t(0o) = ty 0 (00) = h(0) = 0, 


t(0) =t0t,(0) =h(w) =H. 


For the remaining values of z € C — {O} we have 


1 1 1 
H(z) =hot(z)=h (=) = (=) ==, 
z z z 


It follows that t = fy 0 ty is the extended reciprocal function. 
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Since scalings, rotations 
and translations are all 
special types of linear 
function, they too are 
extended to C by this 
definition. 


Next we describe how an extended linear function can be expressed as a 
composite of inversions. In preparation for this, we ask you to tackle the 


following problem. 
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Problem 8 Find the composite t = f2 0 t; where 


t, is the inversion with respect to the unit circle @, 
ty is the inversion with respect to the circle of radius 2 with centre 0. 


Give an interpretation of the composite function t. Also describe what 
would happen if the circle of radius 2 were replaced by a circle of radius 
/k with centre 0. 


Earlier, we noted that every linear function is a composite fos of a scaling s 
followed by an isometry ¢. Since every isometry ¢ is a composite of reflections, 
it follows that every linear function is a composite r, 0...072 07, 05 Of a 
scaling s followed by a number of reflections 71, r2,...,7n. 

Now the only difference between a linear function and an extended linear 
function is that the latter contains an additional point at infinity in its domain. 
Since this additional point maps to itself, it follows that an extended linear 
function is a composite of a scaling that also fixes oo, followed by a number 
of reflections that also fix oo. 

But a reflection that fixes oo is just an inversion in an extended line. Also, 
by the solution to Problem 8, a scaling that also fixes oo is a composite of 
two inversions. It follows that every extended linear function is a composite of 
inversions. 

Combining this with the result of Example 2 above we have the following 
theorem. 


Theorem 5 The extended reciprocal function and the extended linear 
functions can be decomposed into composites of inversions. 


Since inversions of the extended plane map generalized circles onto gener- 
alized circles, we have the following corollary of Theorem 5. 


Corollary The extended linear functions and the extended reciprocal 
function map generalized circles onto generalized circles. 


Example 3 Let ¢ be the extended linear function defined by 


2(-1+4 V3i)2+ 4-21), ifzeC, 


oO, ifz=o. 


t(z) = 


Express ¢ as a composite of inversions of the extended complex plane. 


Solution In addition to mapping oo to ov, the transformation ¢ scales the 
complex plane by the factor |2(—1 + /3i)| = 4, rotates it through the angle 
Arg (2(—1 + J3i)) = = , and then translates it through the vector (4, —2). 
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Extending the Plane 


The scaling by the factor 4 can be decomposed into the composite f2 0 ft, 
where 


t, is the inversion in the unit circle @, 
ty is the inversion in the circle of radius /4 = 2 centred at the origin. 


The rotation through the angle 27/3 can be decomposed into the composite 
t4 0 t3, where 


tz is the inversion in the extended real axis, 
t4 is the inversion in the extended line €4 U {co}, where £4 is the line 


y = V3x. 


(Here 4 is the line through the origin that makes an angle 27/3 with the x-axis.) 
The translation through the vector (4,—2) can be decomposed into the 
composite fg o ts, where 


ts is the inversion in the extended line 05 U {co}, where £5 is the line 


4x —2y =0, 
te is the inversion in the extended line 06 U {co}, where f6 is the line 
4x —2y = 10. 


(Note that 5 is the line through the origin that is perpendicular to the vector 
(4, —2), and £6 is the line that passes through (2, —1) parallel to és.) 
Since f, ts, t4, £3 and f2 o t) all map oo to itself, it follows that 


t=teotsotyotz 0M of. 


Problem 9 Let ¢ be the extended linear function defined by 


—9z+ (6-101), ifzeC, 


t(z) = . 
oO, if z= oo. 


Express ¢ as a composite of inversions of the extended complex plane. 
Hint: Use the decompositions described in the margin. 


5.2.4 The Riemann Sphere 


In the previous subsection, we introduced the point at infinity and the extended 
complex plane in order to provide a simplified explanation of the effect of 
inversion on lines and circles. However, it may seem unsatisfactory to have to 
visualize a straight line as a (generalized) circle, since the ‘ends’ of the line 
appear to be infinitely far apart. Also, it is difficult to visualize where the point 
at infinity should be placed relative to C, other than to think of it as smeared in 
some vague way around the ‘outer edge’ of C. 

Fortunately, there is a model of the extended (complex) plane in which the 
point at infinity appears as an actual point! Consider the complex plane C 
as lying in R? with the real and imaginary axes aligned along the x-axis and 
y-axis, respectively. Then each complex number x + iy may be represented by 
the point (x, y, 0) in the (x, y)-plane. 
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4 4x-2y=0 


The decompositions 
illustrated above 
generalize as follows. 


To scale C by a factor k: 
first invert in the circle 

{z : |z| = 1}, then invert in 
the circle {z : |z| = Vk}. 


To rotate C through an 
angle 0: first invert in the 
extended line {z : Arg z = 
O} U {oo}, then invert in 
the extended line 

{z: Arg z = 49} U {oo}. 


To translate C through a 
vector (a, b): first invert in 
the extended line 

(x,y) :ax+ by = 

O} U {oo}, then invert in 
the extended line 

{(x,y) :ax+ by= 

5 (a? + b*)} U {oo}. 
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North Pole NV 


Pin€ 
corresponds 
toP onS 


Next, draw the sphere S with centre at the origin and with radius 1; we call 
this the Riemann sphere. By analogy with the Earth, we refer to the point 
N = (0,0, 1) at the top of the sphere as the North Pole of S, and the point 
S = (0,0, —1) at the bottom of the sphere as the South Pole of S. 

Each line joining a point P in the complex plane to the North Pole intersects 
the Riemann sphere at some point P’, and vice versa. In this way, we obtain 
a one-one correspondence between all points P in the complex plane and all 
but one of the points P’ on the sphere. The one point on the sphere that cannot 
be associated with a point in the complex plane is the North Pole NV. 

As the point P’ on the sphere moves closer to N, the corresponding point 
P in the plane moves further away from the origin O. This suggests that we 
associate the North Pole N with the point oo in the extended complex plane. 

The function z : S > C, which maps points on the Riemann sphere to the 
associated points in the extended complex plane, is called stereographic pro- 
jection. Since z is one-one and onto, it follows that we can use the Riemann 
sphere as a convenient visualization of the extended complex plane ¢. 


In fact, we can find an explicit formula for stereographic projection of the 
point (X,Y, Z) onto the point z = x + iy, and vice versa. We use capital letters to 
denote coordinates of 
points in R3, and small 
Theorem 6 Let a denote the mapping of the Riemann sphere S onto C letters to denote points or 


given by stereographic projection. Then the stereographic projection of the coordinates in the 
complex plane. 


point (X,Y, Z) of S onto the point z = x + iy of Cis given by 


a =e 
WU yet a : 
7 aa 
Also, the inverse mapping is given by 
7 2x 2y x+y?-1 
1 . 
HE ey) = , . 7 
ee — x?+y2 +1 x27+y?241 


Proof Let the point P’(X, Y, Z) be a point on S (other than NV), and P(x, y) 
the point in the plane that corresponds to P’ under stereographic projection. 
Project the line NP’ P perpendicularly onto the (Y, Z)-plane, with P’ and P 
projecting onto the points Q’ and Q; then Q’ and Q have coordinates (0, Y, Z) 
and (0, y, 0) in R*. Draw the perpendicular Q’R from Q’ to the Z-axis. 
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Since RQ’ is parallel to OQ, the triangles ANRQ’ and ANOQ are similar. It 
follows that NR : RO’! = NO : OQ, or 


1-Z 1 


Y y 


a P(x, y) 


from which we obtain that 
¥ 


ee 
By projecting the line NP’P onto the (X, Z)-plane and using a similar 


argument, we can show that We omit the details. 
x 


x= —. 
1-Z 
It then follows that the mapping z is given by 


y 


X,Y,Z j - a= 

m(X,Y, Jext= TZ tit_z: 

Clearly this formula also holds when N is mapped to oo. For, here Z = 1. 
To find the formula for z~!, we use the fact that X2 + Y?2 + Z? = 1. 

Substituting the values of X and Y from the above formulas into the equation 

X? + Y? + Z* = 1 and doing some manipulation, we deduce that We omit the details. 


2 
2, 2 
1 =: ———, 5 
xe fy + i_Z (5) 


It follows from equation (5) and the earlier equations for X and Y that 


X=x-Z) = 
— = 
x24 y241 
and 
Y=y(1-z)=——” 

Also, we may rearrange equation (5) in the form 

_ xe + a —1 

~ x2 py24 10 


We can then combine these formulas to give the required formula for 1 ~!: 


m—'(x + iy) = (X,Y,Z) 


= 2x 2y ey? 1 
x24 y24 1 x24 y2? 41x? 4+y?241)° 


Notice that stereographic projection distorts the distances between points. 
For example, two points that are close together on S may project onto 
points that are close together in C, or onto points that are far from each 
other in C. 
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We can use the Riemann sphere to visualize extended lines as generalized 
circles in a very natural way, as follows. Consider a point P ona line ¢ in C. 


As P moves along @, the line that joins P to the North Pole sweeps out 
the plane through ¢ and N;; this intersects the sphere in a circle. Hence, as P 
moves along £, the corresponding point P’ on the sphere traces out a circle. 
As P moves further out along the line, the point P’ moves towards the North 
Pole. The point P’ never actually reaches the North Pole, since the North Pole 
corresponds to the ‘gap’ in the line @ that we mentioned earlier. 

We filled the ‘gap’ in @ by attaching oo to @ to obtain the extended line 
£U {oo} in C. On the sphere, the point N fills the corresponding gap in the 
circle, so that the extended line £ U {co} corresponds to an actual circle on the 
sphere — in fact, a circle through N. 

This fact is a particular instance of the following general result. 


Theorem 7 Under stereographic projection, circles on the Riemann 
sphere map onto generalized circles in C. 

In particular, circles on the sphere that pass through NV map onto extended 
lines in C, and circles on the sphere that do not pass through N map onto 
ordinary circles in ¢. 


Proof A circle on the sphere is the intersection of the sphere X* + Y* + 
Z? = 1 with some plane aX + bY+ cZ+d = 0, where a, b and c are not all 
zero. It follows from substituting the expressions 


= 2x 2y 


a) 
—1 
= ——.—,, Y = ———.—._._. and eo° 2 
x2+y24+1 x2+y2+1 


x24 y2 417 
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Subsection 5.2.3 


You met these equations 
in the proof of Theorem 6 
above. 
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for X, Y and Z into the equation X24. ¥?24 Z? = 1 that 
2ax + 2by +c (x? + y? - 1) 
x2+y24] 
This equation may be rewritten in the form 
2ax+ 2by +e(x? +9? - 1) td(x?+y?+ 1) = 0, 
or For its equation is that of a 


(c+ d)x? es (c+ d)y? + 2ax + 2by + (d—c) =0. circle or of an (extended) 


line. 


+d=0. 


It follows that the image of a circle on the sphere is a generalized circle in the 
(extended) plane. 

The circle on the sphere passes through N (0,0, 1) if the plane aX + bY + 
cZ +d = 0 passes through N, and so ifc +d =0. 

It follows that if the circle on the sphere passes through JN, its image in the 
extended plane has an equation of the form 


2ax + 2by + (d—c) = 0, 


and is an extended line. 
On the other hand, if the circle on the sphere does not pass through N, then 
c+d # 0 and its image is an ordinary circle in the plane. | 


Problem 10 Determine the images under stereographic projection 
onto C of the following circles on S. 


(a) The circle {(X,Y,Z): X7+¥?+Z? =1,X = 5} 
(b) The circle of intersection of S with the plane 3X + 2Y+Z=1 


As an illustration of Theorem 7, we describe what happens to the ‘lines of 
latitude and longitude’ of the Riemann sphere under stereographic projection. 

A ‘line of longitude’ on the Riemann sphere is a circle on S that passes 
through the North Pole and the South Pole of S; this projects onto a line 
through the origin in the plane. Similarly, a ‘line of latitude’ on the Riemann 
sphere is a circle at a constant height above the (x, y)-plane, so, by symmetry, 
this projects onto a circle centred at the origin in the plane. 


Just as the lines of latitude and longitude meet at right angles on the sphere, 
so do their projections. This is a consequence of the following remarkable 
result. 


294 


Theorem 8 Stereographic projection preserves the magnitude of angles. 


Proof First, we make the following crucial observation. Recall that if a point 
P moves along a line @ in the complex plane, then the corresponding point P’ 
on the Riemann sphere moves round a circle through N. Indeed, as P moves 
out ‘towards 00’, P’ approaches N; and the line NP’ (in R*) approaches the 
tangent at N to the circle on the sphere. It follows that the tangent at N to the 
circle on the sphere must be parallel to the original line ¢. 

Let C; and C2 be curves in the complex plane that intersect at some point 
P, and let €; and €2 be the tangents to C; and C2 at P. (Recall that we 
define the angle between C; and C2 at P to be the angle between ¢; and 
fy at P.) 

The curves on the Riemann sphere that correspond to ¢; and £2 are circles 
through the North Pole N and the point P’ on the sphere that corresponds to 
the point P in the plane. We have to show that the angle between these circles 
at P’ (that is, the angle between their tangents at P’) is equal to the angle 
between the lines @; and £2 at P. 

But, by symmetry, the angle between the circles at P’ is equal to the angle 
between the same circles when they meet again at N. And, as we saw above, 
the lines £1 and £2 through P are parallel to the corresponding tangent lines 
through N. Thus the angle between the circles at N must be equal to the 
original angle between €; and £2 at P. 

Hence, stereographic projection does preserve the magnitude of angles, as 
asserted. a 


Earlier, we showed that inversion preserves the magnitude of angles but 
reverses their orientation. Having introduced the Riemann sphere, we can now 
show that there is a sense in which this is true even for angles which have a 
vertex at the centre of inversion, or at oo. 

To see this, let £2; and £2 be two lines that intersect at the centre of inversion 
A. The corresponding extended lines £; U {00} and £2 U {oo} intersect again at 
the point oo. On the Riemann sphere these extended lines become circles C1 
and C> that intersect at the North Pole and at the point A’ corresponding to A. 


circle o 
inyersion 
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Recall that inversion has a 
similar property (see 
Theorem 5, 

Subsection 5.1.3). 


tangent to C 


Inversive Geometry 


Under the inversion, each extended line maps to itself, so C, and C2 also 
map to themselves. Notice, however, that the points A’ and N swap over, and 
so the angles at A’ and N swap over as well. By symmetry, these angles are 
equal in magnitude but opposite in direction. So, provided we interpret the 
angle between curves at oo as the corresponding angle on the Riemann sphere, 
we can conclude that the fact that inversion preserves the magnitude of angles 
holds throughout the whole of ¢. 


5.3.‘ Inversive Geometry 


5.3.1 Inversive Geometry 


We now come to the main purpose of this chapter, which is to introduce a 
geometry, known as inversive geometry, that we can use to study the properties 
of circles, lines and angles. 

Recall that according to Klein, a geometry has the following ingredients: 


a space consisting of a set of points; 
a group of transformations that act on the space. 


Each geometry is used to study those properties of figures in its space that are 
preserved by its transformations. For example, Euclidean geometry is used to 
study those properties of figures in R*, such as angle and distance, that are 
preserved by the isometries of R?. 

Since each isometry of R* can be decomposed into a composite of reflec- 
tions, we can think of the group associated with Euclidean geometry as the 
group of all possible composites of reflections. 

This provides the clue that we need to define the transformations of our 
new geometry. Rather than consider composites of reflections, we consider 
composites of inversions. 


Definition A transformation t : C — C is an inversive transformation 
if it can be expressed as a composite of inversions. 


For example, the extended reciprocal function is an inversive transformation 
because it can be expressed as a composite f2 0 f; where f; is the inversion in 
the unit circle and f2 is the inversion in the real axis. Similarly, all extended 
linear functions can be expressed as composites of inversions, so they too are 
inversive transformations. 


Theorem 1 The extended reciprocal function and the extended linear 
functions are inversive transformations. 


Since every inversion preserves the magnitude of angles and maps general- 
ized circles to generalized circles, the same must be true of all composites of 
inversions. We therefore have the following result. 
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Introduction to Chapter 2, 
before Subsection 2.1.1 


It is important to 
distinguish between 
inversions and inversive 
transformations. 
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Theorem 2 Inversive transformations preserve the magnitude of angles, 
and map generalized circles to generalized circles. 


Before we can use the inversive transformations to define a geometry, we 
must first check that they form a group. 


Theorem 3 The set of inversive transformations forms a group under the 
operation of composition of functions. 


Proof We check that the four group axioms hold. 


Gl CLOSURE Let r and s be inversive transformations. Then we can 
write 


r=f,0f0...0ok 
and 
S = ie41 Ofp420...0th, 
where f], f2,...,f, are inversions. Thus 
ros =(t,0t20...0ff) 0 (th41 0 fh420...0%) 


is a composite of inversions, and is therefore an inversive 
transformation. 

G2 IDENTITY The identity for composition of functions is the identity 
transformation given by 


t(z=z (ze©). 


This is an inversive transformation since t = sos, where 
5 is the inversion in the unit circle. 
G3 INVERSES If ¢ is an inversive transformation, then we can write 


1 0196.05.60 Ths 


where f1,f2,...t, are inversions. It follows that t has 
inverse 


6fo..0% Shetere cot, 


which is an inversive transformation. 
G4 ASSOCIATIVITY Composition of functions is always associative. 


Since all four group properties hold, it follows that the set of inversive 
transformations forms a group under composition of functions. a 


Having shown that the inversive transformations form a group, we can use 
them to define a geometry. But what space should we use for the geometry? 
Since the inversive transformations act on c the space will have to be C. But 
what do figures look like in this space? Just as figures in IR? are defined to 
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Here we could have 
chosen any other inversion 
for s since all inversions 
are self-inverse. 


Inversive Geometry 


be subsets of IR’, so figures in € are defined to be subsets of C. Thus circles 
and extended lines are both examples of figures in C. We can now make the 
following definition. 


Definition Inversive geometry is the study of those properties of figures 
in C that are preserved by inversive transformations. 


We have already met the following inversive properties: generalized circles, 
magnitude of angles, and tangency (since zero angles are preserved). 


5.3.2 Relationship with Other Geometries 


Although the point at infinity is crucial to the theory of inversive geometry, 
when we come to interpret results about figures we often choose to confine our 
attention to points in C. By ignoring the point oo in this way, the remaining 
points in € can be interpreted as points of R* and any extended lines in the 
figure can be interpreted as ordinary lines. 

In this sense figures in R? can be interpreted as figures in C, and vice versa. 
For example, the figure in the margin is a figure in R?. It consists of a family of 
circles centred at the origin, and a family of lines that intersect at the origin. If 
we add the point oo to the figure, then all the lines become generalized circles 
and the figure becomes a figure in ¢. 

Earlier, you saw that every Euclidean transformation can be expressed as a 
composite of reflections. Since a reflection can be interpreted as an inversion 
that fixes oo, it follows that we can interpret every Euclidean transformation as 
a composite of inversions, and hence as an inversive transformation. 

An immediate consequence of the above observations is that when figures 
in C are interpreted as figures in R?, all their inversive properties become 
Euclidean properties. This is because any property that is preserved by all 
inversive transformations must also be preserved by all Euclidean transfor- 
mations. For example, the magnitude of angles is both an inversive and a 
Euclidean property. 

A pictorial representation of the relationship between inversive and 
Euclidean geometry is given in the margin. Provided that we ignore the point 
oo, the group of Euclidean transformations can be regarded as a subgroup of 
the group of inversive transformations. Because the Euclidean group is smaller 
than the inversive group, it follows that Euclidean geometry has more proper- 
ties than does inversive geometry. For example, length is a Euclidean property 
but it is not an inversive property. 

How does affine geometry fit into the scheme? In Subsection 2.2.1 we 
showed that every Euclidean transformation is an affine transformation. But 
what is the relationship between affine transformations and inversive transfor- 
mations? 

Certainly, some affine transformations are inversive without being 
Euclidean. For example, earlier you saw that the ‘doubling map’, 


t(z)=2z (ze), 
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We could make this 
interpretation precise by 
adopting the convention 
that a Euclidean 
transformation maps oo to 
oo when it acts on €. 


Inversive group 


Affine group 


See Subsection 2.2.3 
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is an affine transformation. This transformation is not a Euclidean transfor- 
mation, and yet it can be decomposed into the composite f2 o fj, where fy 
is the inversion in the unit circle @, and f is the inversion in the circle 
{IzZ] = J2: ze C}. So, with the usual proviso about oo, ¢ is an inversive 
transformation. 

However, not all affine transformations are inversive transformations. For 
example, the transformation 


t(x) = & 1s (xe R’) 


represents a horizontal shear. This cannot be an inversive transformation, since 
it does not preserve angles. 

It follows that the affine group of transformations contains the Euclidean 
group but overlaps the inversive group, as illustrated in the margin. 


5.3.3 Mobius Transformations 


To enable us to tackle problems in inversive geometry algebraically, we need 
an algebraic representation of the inversive transformations. 

We shall show that each inversive transformation has either the form 
t(z) = M(z), or the form t(z) = M(z), where M is a so-called Mobius 
transformation. 


Definition A Mobius transformation is a function M : C > C of the 
form 
az+b 


WA cade @ 


where a, b, c,d € C and ad — bc £0. 
If c = 0, then we adopt the convention that M(oo) = oo; otherwise, we 
adopt the convention that M(—d/c) = oo and M(w) = a/c. 


Remarks 


1. Every (extended) linear function is a Mébius transformation, as can be seen 
by setting c = 0, d = 1. Also, the (extended) reciprocal function is a 
Mobius transformation witha = d =Oandb=c=1. 

2. Ifc = 0, then the formula for M reduces to M(z) = (a/d)z + (b/d). This 
defines an extended linear function, because the condition ad — bc # 0 
ensures that both a and d are non-zero, and so a/d is also non-zero. Also, 
the convention that M(oo) = oo complies with the definition of an extended 
linear function. 

3. The condition ad— bc ¥ 0 is equivalent to the statement that the ratios a : c 
and b : d are unequal. This is necessary to ensure that the numerator is not 
simply a multiple of the denominator, for if it were, M would be a constant 
function. 
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You saw how a scaling 
can be decomposed into a 
composite of inversions in 


Problem 8, 


Subsection 5.2.3. 


For example, 


Inversive group 


Affine group 


6i x 2—4i x 3 =0, 


so that 
6iz+ 4i _ 
3z+2 


2i, a constant. 
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Problem 1 Which of the following formulas define a Mobius trans- 
formation? 


—z7+2i 
; 3z —4i 
(c) M(z) = —3z + : (d) M(z) = 1+ 


5 
(a) MZ) = . (b) M(z) = 


Z+2i 


For those formulas that do define a Mobius transformation, state the 
image of oo under M. 


Before we look at the properties of Mébius transformations, we first verify 
that a Mobius transformation is indeed an inversive transformation. 


Theorem 4 Every Mobius transformation is an inversive transformation. 


Proof Let M be the Mobius transformation defined by the formula 


If c = 0, then M is an extended linear function, and is therefore an inversive 
transformation. 
If c £0, then for z € C — {—d/c} we can write 


—ad+be+a(cz+ d) 
c(cz+ d) 


(“—*) ( 1 ) a 
= : 1 nea 
Cc cz+d Cc 


It follows that M may be expressed as the composite £3 0 f2 0 ft}, where f2 is 
the extended reciprocal function, and ft; and f3 are the extended linear functions 


M(z) = 


cz+d, ifz#4nm, 
h(z) = 


oO, ifz=o, 
and 


oO, ifz=o. 


—((ad — bc)/c)z+ (a/c), ifz#o, 
R(z) = 


Next, we check that the transformations #3 0 f2 0 fj and M agree also at the 
exceptional points oo and —d/c, as follows: 


13 0 12 0 11(00) = 13 0 f2(&O) = (0) = a/c = M(oo); 
t3 0 f2 of} (—d/c) = Bo (0) = B(C&) = ~ = M(—d/c). 


Since the extended reciprocal function and the extended linear functions are 
inversive transformations, it follows that M is an inversive transformation. Hi 
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Since M6bius transformations are inversive transformations, they must pre- 
serve the magnitude of angles and map generalized circles to generalized 
circles. In fact, we can say slightly more than this, for in the proof of Theorem 4 
we showed that every Mobius transformation is either an extended linear func- 
tion or a composite of two extended linear functions and an extended reciprocal 
function. Since the extended linear functions and the extended reciprocal func- 
tion preserve both the magnitude and orientation of angles, the same must be 
true of Mobius transformations. 


Theorem 5 Mo6bius transformations preserve the magnitude and orienta- 
tion of angles, and map generalized circles onto generalized circles. 


5.3.4 Matrix Representation of Mobius Transformations 


In order to explore some of the other properties of Mobius transformations, it 
is helpful to establish a correspondence between Mobius transformations and 
matrices. 

Recall that, if a, b, c, d, e, f, g, h € C, then 2 x 2 matrices have the 
following properties: 


[ i) ( ‘= ae+bg af+bh 
c d g h ~ ce+dg cf+dh ; 


—1 
a b 1 d -—b : 
( A =o (4 ae if ad — bc £0. 


Here, the condition ad — bc # 0 for the matrix to be invertible is reminiscent 
of the condition ad — bc ¥ 0 that appears in the definition of a Mobius trans- 
formation. This suggests the following fruitful connection between Mébius 
transformations and 2 x 2 invertible matrices. 


and 


Definition Let M be a Mobius transformation defined by 


: () 


where a, b, c,d € C. Then 


is a matrix associated with M. 


Remark 
: a b ; : roe . 
1. Every matrix ( ‘) associated with a Mobius transformation 
Cc 


M(z) = &% is invertible because ad — bc # 0. 


cz+d 
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2. A matrix associated with a Mobius transformation M is not unique, because 
we can multiply the numerator and denominator of the formula (1) by the 
same non-zero constant without altering the transformation. For example, 
both ; : 

due and M(z)= sides : 

3z 4+ 2i 3iz— 2 
specify the same Mobius transformation M, and so both the matrices 


2 i id 2i —-l 
3 2i 3i —2 
are associated with M. 
In general, if A is a matrix associated with a Mobius transformation M, 


then for any non-zero c € C, cA is also a matrix associated with M. In fact, 
every matrix associated with M has the form cA for some c € C — {0}. 


M(z) = 


Example 1 Decide which, if any, of the matrices 


0 4 8 0 -4i 0 
A= (j a a= (5, a a= ({ ) 


are associated with each of the following Moébius transformations M. 


(a) MQ = ou@=— @jmM@=—” 


3i Z z—4i 


Solution 


(a) Every matrix associated with this M is a non-zero multiple of the matrix 


—3i 2 
1 -3i)° 


Hence none of the three given matrices is associated with M. 
(b) Every matrix associated with this M is a non-zero multiple of the matrix 


0 —4i 
Oo) 


Since A; = iA, it follows that A, is a matrix associated with M. 
(c) Every matrix associated with this M is a non-zero multiple of the matrix 


4i OO 
a=(4 a 


Since Az = —2iA, it follows that A» is a matrix associated with M. 


Problem 2 Decide which, if any, of the matrices 


0 1 0 2 210 
a= (4, i) a= (f a a= (7 _ 
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are associated with each of the following Mobius transformations M. 


Vv _ 2i B) A _ 2i Mt _ e+e 
(a) 1@) = Fa 5 (b) 2(z) = — (c) 3) = 7 


A particularly important transformation of C€ is the identity function 


defined by 
t(zy=z (zeO). 


This is a Mobius transformation because it can be written in the form 


_, +0 
~ Oz+1° 


t(z) 


A matrix associated with this Mobius transformation is the identity matrix 


t=(i, 1); 


5.3.5 Composing Mobius Transformations 


The next example illustrates what happens when two Mobius transformations 
are composed. 


Example 2 Determine the composite M; 0 Mz, where M, and Mp are the 
Mobius transformations defined by 
iz+1 zti 


d Mo(z) = 
gag 1 ei 


M\(z) = 


Solution Since M, and M2 are one-one mappings of € onto C, the same 
must be true of M; 0 M>. A formula for M; 0 Mo(z) is 


zt+i 
M, o Mo(z) = M, 
2z-—1 


i (#4) ae 


_ $6404 C2=—) 

~ 2(¢+i)-222-1) 
OA jg=2 

= we +20. 


We can check that this formula defines a Mobius transformation by noting that 


(2+ i)(2 + 2i) — (—2)(—2) = -2+ 61 #0. 
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Strictly speaking, we 
should check our 
convention for the 
exceptional points 
separately. For example, 
by convention 

M2(co) = i so that 


Mj 0 My(0o) = My (5) 


This agrees with the value 
obtained when we apply 
our convention to the 
formula for Mj o M>. 
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Thus Mj o M2 is a Mobius transformation given by 


(2+i)z—2 
oe TROY 


M, o M2(z) = 


Problem 3 Write down matrices A; and Ao associated with the 
Mobius transformations M, and M2 defined in Example 2. Calculate the 
product A; Az, and compare it with the Mébius transformation M; 0 M2. 


In Example 2, the composite of two Mobius transformations M; and M) 
turned out to be another Mobius transformation. The solution to Problem 3 
demonstrates that the product of matrices associated with M, and M) is a 
matrix associated with M; o M). The following theorem confirms that this 
is always the case. 


Theorem 6 Composition of Mébius Transformations 

Let M; and M> be Mobius transformations with associated matrices Aj 
and A», respectively. Then M, o M2 is a Mobius transformation with an 
associated matrix Aj Ao. 


Proof Let M; and M2 be defined by 


b 
“ and Mo(z) = eee s, 


M\(z) = : 
it) cz+d ezth 


Since M, and M2 are one-one mappings of € onto C, the same must be true 
of M, o M2. We can obtain a formula for M, o M2(z) as follows: 


ec+ f 
My © Matz) = Mi ( ) 
gzth 
ect f 
_ ¢ (=) +b Strictly speaking, we 
_ ect f d should check the 
7 (Si ) + exceptional cases 
a(ez+ fy) +b(gz+h) separately. However, we 


= shall omit this checking 
c(ez+ f) +d (gz +h) from here onwards. 


__ (ae + bg) + (af + bh) 
~ (ce +dg)z+ (cf + dh)’ 


Since M, o M2 is one-one, this formula cannot remain constant as z varies, so 
(ae+bg)(cf+dh) — (af+bh)(ce+dg) € 0. The formula must therefore define 
a Mobius transformation. 
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A matrix associated with M, o M2 is 
ae+bg af+bh 
ce+dg cftdh)’ 


which is the product of the matrices 


Ga) CS) 
c d g h 
associated with M, and Mp, respectively. 0 


Theorem 6 enables us to calculate the composite of two Mobius transforma- 
tions by using the following strategy. 


Strategy To compose two Mébius transformations M; and M2: 


1. write down matrices A; and Ag associated with M, and Mo; 

2. calculate A; Ag; 

3. write down the Mobius transformation M, o M2 with which AjAo is 
associated. 


Example3 Use the strategy to determine the composite M, 0M? of the Mobius 
transformations 


3z+1 2iz + 3 
M\(z) =—— and Mo(z)= : 
iz—2 Zz 


=, 


Solution The Mobius transformations M, and M2 have associated matrices 


3 1 2i 3 
a= (j - and m= (4 2) 


respectively. It follows that a matrix associated with M, o M2 is 


3 1 2i 3 1+ 6i 7 
Aida =(j Be 3)={ —4 aaa) 
The composite M; o Mp is therefore the Mobius transformation defined by 


(14+ 6i)z +7 
—47 + (4+ 3i)' 


M, o M2(z) = 


Problem 4 Let M; and M2 be the Mobius transformations defined by 
3z+1 2iz + 3 
M\(z) =—— > and M(z) = ——. 
iz—2 z—2 


Use the strategy to determine each of the following composites. 


(a) M20 M, (b) Mj o M, 
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Problem 5 Let M,; and M2 be the Mobius transformations defined by 


z—i 2z+i 
M\(z) = ara) and Mo(z)= a 


Use the strategy to determine the composite Mj o Mp. 


5.3.6 Inverting Mobius Transformations 


In the solution to Problem 5 you saw that the composite Mo M2 of the Mobius 
transformations 


z—-i 2z2+i 
Me) 5 and MON 


is the identity function on C. This shows that M) is the inverse function of 
M,. In terms of matrices, this is equivalent to saying that there are matri- 
ces associated with M, and M2 whose product is the identity matrix I. For 
example, 


We can use this idea to find the inverse of any given Mobius transformation 
M. The inverse function M~! certainly exists, since M is a one-one transfor- 
mation from € onto C. To find the inverse, let A be a matrix associated with 
M. Since A is invertible, 


AA“! =I=A"!A, 


so the Mébius transformation associated with the matrix A~! must be the 
inverse function M—!. 


Now, if M(z) = a+b we can take A = (< 4 , so that 
Cc 


cz+d’ 


aia_i (4 -6 
~ ad—be\-c a)’ 


But any non-zero multiple of A~! is also a matrix associated with M~!, so we 


shall usually use the matrix ( 
—c a 


) as a matrix for M~!. Then 


We summarize the result of this discussion in the following theorem. 
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Theorem 7 Inverse of a Mobius Transformation 
The inverse of the Mobius transformation 


az+b 
M(z) = 
(2) Gear al 
is also a MObius transformation, and it may be written in the form 
= Oe = Io 
M~!(z) = ———. 
C5 @! 


For example, the inverse of the Mobius transformation 
iz+1 
2z—2 


M(z) = 


is given by 


—2z-1 2z+1 
M'(zZ) = - 
@) —2z+i 2z—-i 


Problem 6 Determine the inverse of each of the following Mébius 
transformations. 


(a) My(z) = 22) Maz) = = (©) Mg) = 2% 


5.3.7. The Inversive Group 


We now have all the information we need to prove the following theorem. 


Theorem 8 The set of all MGbius transformations forms a group under 
composition of functions. 


Proof We show that the four group axioms hold. 


Gl CLOSURE By Theorem 6, the composite of two Mobius transforma- 
tions is itself a M6bius transformation. 
G2 IDENTITY The identity is the Mobius transformation given by 
— i1z+0 
M(z) = Oz+1° 
G3 INVERSES By Theorem 7, every Mobius transformation has an 
inverse. 


G4 ASSOCIATIVITY Composition of functions is always associative. 


It follows that the set of all Mébius transformations forms a group under 
composition of functions. | 


Having shown that the set of Mébius transformations forms a group, we 
now investigate its relationship with the group of all inversive transformations. 
We know from Theorem 4 that every MOébius transformation is an inversive 
transformation, but is every inversive transformation a Mébius transforma- 
tion? Clearly, the answer is no. For example, an inversion cannot be a Mébius 
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transformation since it reverses the orientation of angles, whereas all Mébius 
transformations preserve the orientation of angles. 

Although inversions are not Mobius transformations, there is a close 
connection between inversions and MG6bius transformations. 


Theorem 9 Every inversion ¢ has the form t(z) = M(z), where M isa 
M6bius transformation. 


Proof. If f is an inversion of C in an extended line, then by Theorem | of 
Section 5.2 it must have the form ¢(z) = az + b, with t(co) = on. It follows 
that t(z) = M(z), where M is the Mobius transformation 


az+b 
NS at 


On the other hand, if f is an inversion of c in acircle of radius r with centre c, 
then by Theorem 3 of Section 5.2, 


2 2 ss > 2 = 
r r -c(z—€ cz (r* =-ce 
eax dpe eK: _ f : ) 
z—cC Z—-C Z—-C 
So once again ¢ has the form t(z) = M(z), where M is the Mébius 
transformation 
cz + (r?2 —cé 
M(z) = eee) B 


We can now show that every inversive transformation ¢ has the form f(z) = 
M(z) or t(z) = M(zZ), where M is a Mobius transformation. 


Theorem 10 Every inversive transformation t can be represented in ¢ by 
one of the formulas 


where a, b, c,d € Cand ad — bc £ 0. 


Proof We first show that the composite of two inversions f; and fz is a Mébius 
transformation. By Theorem 9 above, we can write t)(z) = M,(z) and f(z) = 
M>(z), where M, and M> are Mobius transformations. Thus 


ty 0 12(z) = t(M2(z)) = Mi (M2(z)). 


But if 
az+b 


mt) = td 
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It cannot have the form 


t(z) = az+b, since t 


reverses the orientation of 


angles. 


Notice that 


c+ (—@) — (r? —cé) +1 


r? £0. 
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then we can define a Mobius transformation M3 by 


ae az+b 

M3(z) = M2(z) = = = =. 

32) = M2(@) (=) ézt+d 
Since ty 0 fo(z) = M,(M2(zZ)) = M,(M3(z)), it follows that t) o fy 
is a composite of two Mébius transformations, and is therefore a Mdébius 


transformation. 
Next, let ¢ be an arbitrary inversive transformation, and write 


t=f,0%0...0f,, 


where f],f2,...,f, are inversions. If m is even, then we can rewrite ft as a 
composite of M6bius transformations by pairing together the inversions in the 
form 


t = (t, 0f2) 0 (20 t4) 0... 0 (ty_] Of). 


It follows that t is a Mébius transformation, M say, so we can write t(z) = 
M(z). 

To deal with the case where n is odd, let r be the extended conjugation 
function. Since r is its own inverse, we can write 


t=(tjot20...0t,or)or. 


Here the composite in the bracket involves an even number of inversions, so 
it must be a MO6bius transformation, M say. Hence t(z) = M or(z) = M(z). 
|_| 


Theorem 10 provides us with an insight into the structure of the group G 
of all inversive transformations. Those inversive transformations that can be 
written in the form 

im az+b 

cz +d 
are Mobius transformations, and we know from Theorem 8 that they form 
a subgroup of G. We refer to these transformations as direct inversive 
transformations because they preserve the orientation of angles. 

The remaining inversive transformations are of the form 


az+b 
cz+d 
These are Mobius transformations composed with the extended conjugation 


function, and so they reverse the orientation of angles. For this reason we shall 
refer to them as indirect inversive transformations. 


t(z) = 


5.3.8 Images of Generalized Circles 


We have shown that an inversive transformation maps generalized circles 
onto generalized circles. However, apart from the special case of inversion 
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in the unit circle, we have not yet given a strategy for finding the image of a 
generalized circle under an inversive transformation. 

One such strategy is based on the fact that every generalized circle in C is 
uniquely determined by any three points lying on it. Indeed, if the three points 
lie in C, and are non-collinear, then they determine a unique circle in C. On 
the other hand, if the three points are collinear, or if one of the points is oo, 
then they determine a unique (extended) line. 


Strategy To determine the image of a generalized circle C under an 
inversive transformation f: 


1. write down three points z1, z2, z3 on C; 
2. determine the images f(z1), t(z2), t(z3); 
3. the image t(C) is the (unique) generalized circle through ft(z1), f(z2), 


t(z3). 


Example 4 Use the strategy to find the image of the unit circle @ under the 
inversive transformation defined by 


Solution We first pick three distinct points on the unit circle. There is no 
definite rule about which points should be chosen, so to keep the calculations 
simple, we pick the points 1, i and —1, as shown below. Now 


ii 


t(1) =o, t(i) = = i =0 and 
-I+i 1 ; 
t( 2 a ee Lye 


So the image of @ is the generalized circle through the points oo, 0 and 


5(1 — i). This is an extended line through the origin with slope —1. 


= D ON da-a 


bts) 
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If you can pick a point 
that maps to oo then you 
know immediately that the 
image is an extended line. 
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Problem 7 Let ¢ be the inversive transformation defined by 


z—2i 
e2> 


t(z) = 


Use the strategy to determine the image of each of the following 
generalized circles under f: 


(a) the extended line £U{oo}, where £ is the line with equation x+y = 2; 
(b) the circle with equation (x + 1)? + y* = 1. 


5.4 Fundamental Theorem of Inversive Geometry 


5.4.1 Comparison with Affine Geometry 


In Subsection 2.3.2 we introduced the Fundamental Theorem of Affine Geom- 
etry, which states that, given any three non-collinear points in the plane, there 
is always an affine transformation that maps the points to another three given 
non-collinear points. Since a triangle is uniquely determined by its three (non- 
collinear) vertices, and since affine transformations map triangles to triangles, 
we were able to use the Fundamental Theorem of Affine Geometry to show 
that all triangles are affine-congruent. 

In this section we prove an analogous Fundamental Theorem of Inversive 
Geometry which states that given any three points in the extended complex 
plane C there is always an inversive transformation (in fact a Mobius transfor- 
mation) that maps them to another three given points in the extended complex 
plane. Since a generalized circle is uniquely determined by any three of its 
points, and since inversive transformations map generalized circles to general- 
ized circles, we can show that all generalized circles are inversive-congruent. 
That is, given any two generalized circles C, and C2 in the extended plane, 
it is always possible to find an inversive transformation (in fact a Mébius 
transformation) that maps C; onto C2. 


5.4.2 Mapping Three Points to Three Points 


The proof of the Fundamental Theorem of Inverse Geometry is very similar to 
the proof of the Fundamental Theorem of Affine Geometry which we gave in 
Subsection 2.3.2. There we described how to construct an affine transformation 
which maps one given set of three (non-collinear) points in R? onto another. 
We did this by forming a composite of two affine transformations which map 
the points via the auxiliary points (0, 0), (1, 0) and (0, 1). 

A similar idea works for the Fundamental Theorem of Inversive Geometry. 
In this geometry the space is € rather than R2, and we map via the auxiliary 
points 0, 1 and oo. 

So let us start by considering how to construct a Mébius transformation that 
maps three given points z;, Z2, z3 in € to the points 0, 1, oo, respectively. 
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Here x + iy = z, as usual. 


affine 


AS 


inversive 


one 
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First observe that if z;, z2, z3 belong to C, then any Mobius transformation 


of the form 
Z—Z 
M(z) = K-— 


£— £3 


maps Zz; to 0 and z3 to oo. We can therefore obtain the required Mobius trans- 
formation by choosing K so that M(z2) = 1. The following example illustrates 
how this is done. It also shows how the method can be modified to deal with 
cases where one of z1, Z2, Z3 iS CO. 


Example 1 For each set of three points given below, determine a MGbius 
transformation M that maps the points to 0, 1, oo, respectively. 


(a) 3,-1.3  (b)0%0,1,2 =©i,~,3 5,2, 


Solution 


(a) To ensure that M (+) = 0 and M(3) = 00 we let M have the form 


a1 
2 
ae 


for some complex number K. Since M(—1) = 1, we must have 


Mea Ke 
Zz 


so that K = g The required Mobius transformation is therefore 


re ane 
a ~ 3z-9° 


(b) To ensure that M(oo) = 0 and M(2) = oo we let M have the form 


M(z) = 


z-2 
Since M(i) = 1, we must have 
K 
l=—,\, 
i-2 
so that K = i — 2. It follows that 
M( = i-2 
z= — 


(c) To ensure that M(i) = 0 and M(3) = co we let M have the form 

z- 

7-3 

In this case there is no need to include a constant K because M(oo) is 
already equal to 1. 


M(z) = 
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Here K can be any 
complex number. 


This has the form 
az+b 
M = =. 
(z) ae) 


c= 1landd = -—3. 


This has the form 
az+b 
M = 
(z) FonTe 


witha =0,b= K,c=1 
and d = —2. Since c # 0, 
it follows from the 
definition of a Mobius 
transformation that 

M (oo) = a/c = 0, as 
required. 


This has the form 
_— aztb 
M(z) = Cz+d 


witha = 1,b=—i, 

c = 1 andd = —3. Since 
c #0, we have 

M(co) = a/c = 1, as 
required. 
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(d) Here we require M(5) = 0 and M(oo) = ~, so M is an extended linear 
function of the form 


M(z) = K(z—5). This has the form 
M(z) = 44 
Since M(2) = 1, we must have me 


witha = K,b=—5K, 
1= K(2—-5), c = Oandd = 1. Since 
c = 0, it follows from the 


so that K = — i It follows that definition of a Mobius 
7 transformation that 
M(z) =—4(z—5). M(o0) = 00. 


Guided by the above example we can summarize all the possible cases to 
obtain the following general strategy. 


Strategy To determine the MGbius transformation M which maps three 
given points z1, Z2, z3 onto the points 0, 1, oo respectively: 


1. choose the appropriate form of mapping from the following formulas 


for M: 

e for z1, 22,73 0,1,c0 use M(z)= Ka 
e for oo, z2,z3 +> 0,1,00 use M(z)= — 
e for zj,00,z3+-> 0,1,00 use M(z) = =; 


e for z1,zZ2,00 > 0,1,c0 use M(z)= K(z—2z1); 
2. find the complex number K for which M(z2) = 1. 


Problem 1 For each set of three points given below, determine a 
Mobius transformation M that maps the points to 0, 1, oo, respectively. 


(a) —1,-3,0 (b)#,2,1 (c)o0,-3,2  (d) 3,2, 00 
We are now in a position to prove the main theorem of this section. 


Theorem 1 The Fundamental Theorem of Inversive Geometry 

Let z1, Z2, z3 and w1, w2, w3 be two sets of three points in the extended 
complex plane C. Then there is a unique Mobius transformation M which 
maps Zz; to w1, Z2 to w2, and z3 to w3. 


Proof According to the above strategy there is a Mobius transformation M, 
which maps the points z;, z2, z3 to the points 0, 1, 00, respectively. Similarly, 
there is a Mobius transformation Mz which maps the points w 1, w2, w3 to the 
points 0, 1, oo, respectively. 


Fundamental Theorem of Inversive Geometry 


M,'oM, 
a 


\@> A 


The composite M = M, ' 4 M, is therefore the required Mobius transfor- 
mation which maps z1, Z2, z3 to the points w1, w2, w3, respectively. 

To check uniqueness we first observe that the identity is the only Mébius 
transformation which maps each of the points 0, 1, oo to themselves. Indeed, if 


az+b 


M = 
@) cz+d 


is a Mébius transformation which maps oo to itself, then c = 0. Also, if M 
maps 0 to itself, then M(0) = b/d = 0, so b = O. It follows that M(z) = 
(a/d)z. But if M maps | to itself, then M(1) = a/d = 1, which implies that 
a = d. Putting all this together we conclude that M is the identity M(z) = z, 
as required. 

Next suppose that M and M’ are two Mobius transformations which satisfy 
the conditions of the theorem. 


Mand M' 


wr E> A 


Then the composites Mz 0 Mo M;! and Mz 0 M'o M;! must both be Mobius 
transformations which map each of the points 0, 1, oo to themselves. Since 
this implies that both composites are equal to the identity, we can write 


M2.0MoM,!=M)0M'oM,". 


If we now compose both sides of this equation with M, ' on the left and 
with M, on the right, then we obtain M = M’, as required. | 
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If we actually need to find the Mobius transformation which maps one set 
of three points onto another set of three points we simply follow the strategy 
used to prove part (a) of the Fundamental Theorem. 


Strategy To determine the Moébius transformation M which maps the 
points Z1, 22, z3 to the points w), w2, w3, respectively: 


1. find the Mobius transformation M, which maps the points z1, z2, z3 to 
the points 0, 1, 00, respectively; 

2. find the M6bius transformation M2 which maps the points w1, w2, w3 
to the points 0, 1, oo, respectively; 

3. calculate M = M;' o My. 


The following example illustrates how to implement this strategy. 


Example 2 Find the Mébius transformation M which maps the points i, 00, 3 
to the points 7 —1, 3, respectively. 


Solution We follow the steps in the above strategy. 


1. From part (c) of Example 1, we know that the Mobius transformation M, 
which maps the points 7, 00, 3 to the points 0, 1, oo, respectively, is given by 


zZ-1 
M — : 
1(z) pas 


2. Also, from part (a) of Example 1, we know that the Mo6bius transformation 
Mp) which maps the points 7 —1, 3 to the points 0, 1, oo, respectively, is 
given by 

8z—4 

32-9 


3. Matrices associated with M, and M2 are 


M>(z) = 


also, by Theorem 7 of Subsection 5.3.6, a matrix associated with the inverse 


of M2 is 
-9 4 
—3 8)° 


Hence, a matrix associated with M = M, 'oM is given by 


—9 4\/1 -i\  (-5 -12+9i 
—3 s/\1 -3/  \ 5 -2443i)' 
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The required Mobius transformation is therefore As acheck, you can verify 
=—57— 124 9) that M(i) = ,, 
= = ar M(co) = —1 and 
= M(3) =3. 
Problem 2 


(a) Find the M6bius transformation which maps the points —1, i, | to 
the points —1, —3, 0, respectively. 
(b) Find the Mobius transformation which maps the points 3, 00, —2 to 


the points 3, i, 1, respectively. 


5.4.3 Circles through Four Points 


We now explore an application of the Fundamental Theorem which enables us 
to determine whether four given points z1, z2, z3 and z4 lie on some generalized 
circle. 

If there is a generalized circle that passes through z1, z2, z3 and z4, then 
it must be the unique generalized circle C that passes through z;, z2 and z3. 
We have to decide whether C also passes through z4. To do this, consider the 
Mobius transformation M that maps Zz}, Z2, z3 to 0, 1, oo, respectively. Under 
M the image of C is the extended real axis, for this is the only generalized 
circle that passes through 0, | and oo. If z4 lies on C, then its image under M 
must lie on the real axis; whereas if z4 does not lie on C, then its image under 
M cannot lie on the real axis. 


Example 3 Determine whether the four points i, 1 + 47, 3, 4+ 3i lie ona 
generalized circle. 


Solution First, we determine the Mébius transformation M that maps i, 
1+ 4i, 3 to 0, 1, oo, respectively. Following the strategy in the previous 
subsection, we observe that this transformation must be of the form 


z-1 
M(z)=K 
z-3 


for some complex number K. Since M(1 + 47) = 1, we must have 


_ d+4i) =i 143i 


(+4i)-3 —244i’ 


You met this method of 


eo simplifying a quotient by 
| 244i = (=2 + 41) ~ 31) me Pe multiplying both 
14+3i 10 numerator and 


denominator by the 

conjugate of the 
(z—i)1 +2) denominator earlier, in 

M(z) = 3 . Subsection 5.2.1. 


Thus the transformation M is given by 
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It follows that 

(44+ 21/)(1+i) 246i 
i-s 143: 

Since this is real, it follows that 7, 1 + 47, 3, 4+ 37 do all lie on a generalized 

circle. 


M(4+4+3i) = 


In this example we showed that the points 7, 1 + 47, 3, 4 + 37 all lie on 
a generalized circle; but suppose we want to show that the points lie on an 
ordinary circle. To do this we need to check that the generalized circle that 
passes through the points is not an extended line. If it were an extended line, 
then it would pass through oo, and so M(oo) would be real. Since M(oo) = 
1 +7 is not real, it follows that the points do indeed lie on an ordinary circle. 


Strategy To determine whether z, z2, z3 and zz, lie on a circle: 


1. find the MG6bius transformation M which maps 2Z1, z2, z3 to 0, 1, 00, 


respectively; 

2. the points z, Z2, 3, z4, lie on a generalized circle if and only if M(z4) 
is real; 

3. the generalized circle in Step 2 is a circle provided that M(oo) is not 
real. 


Problem 3 Determine whether each of the following sets of four 
points lies on a circle. 


(a) 0, —4, —2i, —1 — 3i (b) =1, =1,4,2—1 


5.4.4 Congruence of Generalized Circles 


We end this section by stating an important consequence of the Fundamental 
Theorem of Inversive Geometry. 


Theorem 2 Let C; and C> be generalized circles in the extended complex 
plane. Then there is a Mobius transformation that maps C; onto C2. 


Proof Leta, b, c be any three points on C, and let d, e, f be any three points 
on C2. By the Fundamental Theorem of Inversive Geometry, there is a Mébius 
transformation M that maps a, b, c to d, e, f, respectively. 


ee C,=M(C,) 


1 
a 
e=M(b) 
b 


c d=M(a) 


f=M(c) 
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M (co) cannot be oo, as M 
is one—one and 
M(3) =o. 


If M(z4) and M(oo) are 
both real, then z1, z2, z3, 
zq lie on a line. 
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Since M maps generalized circles to generalized circles, it must map C, to 
a generalized circle through d, e and f. But a generalized circle is uniquely 
determined by any three of its points, so C2 is the only generalized circle that 
passes through d, e and f. It follows that M maps C; onto C2. | 


One of the remarkable consequences of Theorem 2 is that all generalized 
circles are inversive-congruent. In particular, an ordinary circle is inversive- 
congruent to any given (extended) line. Since lines are often easier to inves- 
tigate than circles, we can sometimes simplify a problem that involves the 
inversive properties of a figure by using an inversive transformation to map 
one or more of the circles in a figure onto (extended) lines. You will see several 
examples of this in the next section. 


5.5  Coaxal Families of Circles 


We now prove some lovely theorems about circles, where the beauty of the 
final results is matched by the beauty of the proofs themselves. Inversion is 
our key tool. 


5.5.1 Apollonian Circles Theorem 


We start with the proof of the Apollonian Circles Theorem, which we stated at 
the beginning of the chapter. 


Theorem1 Apollonian Circles Theorem 

Let A and B be two distinct points in the plane, and let k be a positive real 
number other than 1. Then the locus of points P that satisfy PA: PB=k: 1 
is a circle whose centre lies on the line through A and B. 


We give two proofs of this result. The first uses methods from Euclidean 
geometry, and has the advantage of providing us with equations for the circles 
in terms of k. The second proof uses the methods of inversive geometry, and 
has the advantage of providing us with a deeper insight into the geometry of 
the circles. 


First Proof 
To keep the algebra simple, we introduce x- and y-axes into the plane such that 
A and B have coordinates (—a,0) and (a, 0), respectively, where a > 0. 

Now fix a value of k > 0, k € 1, and let C be the locus of points P that 
satisfy PA : PB = k : 1. Then a point P(x, y) belongs to C if and only if 
PA =k.- PB, and since k > 0 this is equivalent to the equation 

PA? = Kk? . PB. 
Using the Euclidean formula for distance between points, we see that this last 


equation holds if and only if 


(x +a)? +y? = k(x —a)* + y?). 
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Sadly this is not always 
the case in Mathematics! 


Recall that if k = 1, then 
P must lie on the 
perpendicular bisector of 
AB. 


The second proof is given 
later. 
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Multiplying out the brackets and collecting terms in x7, y? and x, we see that 
this is equivalent to 
e0=F) + 70=F)4 2 4) BO. 
Since k # 1, this holds if and only if 


1+K 
2 2 2. 
x“+y +2a(725) +a = 0. 


It follows that the locus C is a circle with centre c and radius r where 
1+k? 
c= |-a 3 ,0 
1—k? 
»(itReY 
r=,la 
1—k?2 


In particular, the centre lies on the x-axis, which by our choice of axes is the 
line through A and B. | 


2ak 
a= . 
[1 — k?| 


If k = 1, then the locus of points that satisfy PA = k - PB is the line @ that 
bisects AB at right angles. If we adopt the convention that this locus includes 
the point oo, then we can think of this locus as the extended line £ U {00}. With 
this convention, every positive value of k gives rise to a generalized circle 
known as a circle of Apollonius. The family of all such circles is known as the 
Apollonian family of circles defined by the points A and B. 

As k increases through the interval (0, 00), the corresponding circles range 
through the Apollonian family. 

When 0 < k < 1, we have that PA < PB, and so P is closer to A than it is 
to B. The circles that correspond to these values of k therefore lie on the same 
side of £ as does A. For values of k close to 0, PA = k - PB is small, and so 
the corresponding circles are close to A. As k tends to 1, the circles ‘grow’ and 
become ever closer to €. 

When k = 1, the corresponding ‘circle’ is the extended line @ U {oo}. 

When k > 1, we have that PA > PB, so that P is closer to B than it is to A. 
The circles that correspond to these values of k therefore lie on the same side 
of £ as does B. As k increases from 1, PB = (1/k) - PA becomes smaller, and 
the corresponding circles ‘shrink’ and become ever closer to B. 

In light of the above discussion, it is sometimes convenient to refer to the 
points A and B as point circles corresponding to the cases k = 0 and *k = oo’, 
respectively. With this convention, every point of the plane belongs to a unique 
‘circle’ associated with the Apollonian family. 


Example1 For the Apollonian family defined by the point circles (—1, 0) and 
(1, 0), determine the equation of the circle in the family that passes through the 
point (2, 1). 
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Here we are using the 
formulas for centre and 
radius given in 
Theorem 2, 

Subsection 1.1.2. 


Notice that the centre of 
the circle is not at either of 
the points A or B. 


A ‘circle of Apollonius’ is 
often referred to as an 
“‘Apollonian circle’. 


O0<k<1 l<k 


> 


a, 


(-I, 0) 
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Solution Let P be a point (x, y) in the plane whose distance from the point 
(—1, 0) is k times its distance from the point (1, 0). Then, if we use the 
Euclidean formula for distance between points in the plane, we obtain 


(x+ 1)? +y*? = k(x - 1)? + y’). 


For each value of k this yields an equation for the corresponding Apollonian 
circle. The point (2, 1) lies on the Apollonian circle for which 


(24+1)? +12? =*(2- 1)? + 1%), 


that is, k = J5. 
The equation of the Apollonian circle through the point (2, 1) is therefore 


(x+1)? + y? =5(@—1)? + y), 


which simplifies to 


x? 4+ y?-—3x4+1=0. 


Problem 1 For the Apollonian family defined by the point circles 
(O, —1) and (0, 2), determine the equation of the circle in the family 
that passes through the point (1, 1). 


Problem 2 The circles C; and C2 in an Apollonian family of cir- 
cles have the segments [—18, —2] and [3, 12], respectively, as diameter. 
Determine the point circles in the family, and hence the equation of the 
Apollonian circle in the family that passes through the point (6, 9). 


We now give a second proof of the Apollonian Circles Theorem, using 
inversive geometry. 


Theorem1 Apollonian Circles Theorem 

Let A and B be two given points in the plane, and let k be a positive real 
number other than 1. Then the locus of points P that satisfy PA: PB=k: 1 
is a circle whose centre lies on the line through A and B. 


Second Proof 


Let C be the locus of points P that satisfy PA = k-PB, and let t be the inversion 
in the circle with centre A and radius 1. We shall show that C’ = t(C) isa 
circle, and hence that C = t~!(C’) isa generalized circle. 

Let P be an arbitrary point in the extended plane, and let +(B)=B’, 
t(P) = P’. Then by the definition of inversion, 


AB-AB’=1 and AP-AP! =1, (1) 
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We specify radius 1 here 
simply for definiteness; 
any positive radius would 
serve equally well. 
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so that 
AB _ AP 
AP’ AB’ 
This shows that the sides AB and AP of AAPB are proportional to the sides AP’ 


and AB’ of AAB’ P’. Since ZBAP is the same as 7 P’AB’, it follows that AAPB 
is similar to AAB’ P’. Hence 


os AP’ 
PB AB’ 
But, from equation (1), AP’ = 1/AP, so P is related to P’ by the equation 
PB 
pl! — : (2) 
AB .- AP 
Now if P lies on the locus C, then AP (= PA) = k - PB, so that 
! pl — 1 
k- AB 


Thus P’ lies on a circle C’ of radius 1/(k - AB) and with centre B’. 
Conversely, if P’ lies on C’, then B’P’ = 1/(k - AB), so from equation (2), 
1 PB 
k-AB AB-AP’ 
Thus AP = k - PB, and so P lies on the locus C. 
It follows that the inversion t maps the locus C onto the circle C’. But 
t~! = t maps generalized circles to generalized circles, so C = t7!(C’) is 
a generalized circle. | 


This proof sets up a one-one correspondence between the family of Apol- 
lonian circles and a family of concentric circles. Each Apollonian circle In particular, the extended 
corresponds to a value of k, and this in turn corresponds to a circle of radius _ line in the Apollonian 


1/(k - AB) with centre B’ = 1(B). Henly cone pends e 
k = 1, which corresponds 


to the circle of radius 
' 1/AB = AB’, with centre 
B’. As you would expect, 


this circle passes through 
the centre A of the 
inversion f. 


The importance of this correspondence is that it enables us to characterize 
Apollonian families of circles in terms of inversive transformations. Such a 
characterization is ideal for tackling problems in inversive geometry so we 
state it in the form of a theorem. 


Theorem 2 Let A and B be distinct points in the plane, and let t be the 
inversion in the circle with centre A and radius 1. Then the Apollonian 
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family of circles defined by the point circles A and B is mapped by t¢ to 
the family of all concentric circles with centre t(B); and the family of all 
concentric circles with centre t(B) is mapped by f to the Apollonian family 
of circles defined by the point circles A and B. 


A remarkable consequence of this theorem is that in inversive geometry 
all Apollonian and concentric families of circles are congruent to each other. 
Indeed the theorem shows that every Apollonian family of circles is congruent 
to a family of concentric circles, so it is sufficient to show that all families of 
concentric circles are congruent to each other. This is easily achieved by noting 
that any family of concentric circles can be mapped onto any other family of 
concentric circles by a translation that makes their centres coincide. 


5.5.2 Families of Circles 


Any two (distinct) generalized circles in the extended plane are related in 
precisely one of the following ways: 


1. they may not intersect; 
2. they may intersect in precisely one point; 
3. they may intersect in two distinct points. 


Case | Case 2 Case 3 


In the previous subsection you met Apollonian families of circles, in which 
no two circles intersect; in the following subsection we will show that any two 
non-intersecting circles determine an Apollonian family of circles. We now 
consider the two other families, namely families of generalized circles that 
intersect at precisely one point and families of generalized circles that intersect 
at precisely two points. 


PH@® 


Apollonian circles Circles through one point Circles through two points 


Each of these three families contains precisely one extended line known as 
the radical axis of the family. The centres of the circles in each family lie on 
a line, or axis, that is perpendicular to the radical axis. Since all the circles in 
each family are symmetrical about the axis through their centres, the families 
are known as coaxal families. 
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For ¢ is self-inverse. 


Recall that a translation is 
an inversive 
transformation provided 
we adopt the convention 
that co maps to oo. 
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Definition A coaxal family of circles in the plane is a family of (general- 
ized) circles of one of the following types: 


1. an Apollonian family, with particular point circles; 
2. a family that intersect at one particular point; 
3. a family that intersect at two particular points. 


The extended line in each family is called the radical axis of the family. 


Given the Apollonian family of circles ¥ defined by point circles A and B, 
there is a corresponding coaxal family of circles Y that pass through A and B. 
Conversely, given a coaxal family of circles Y that pass through distinct points 
A and B, there is a corresponding Apollonian family of circles ¥ defined by 
the point circles A and B. 

The following theorem describes a remarkable relationship between the two 
families of circles, Y and Y. 


Theorem 3 Coaxal Circles Theorem 

Let A and B be distinct points in the plane. Let .F be the Apollonian family 

defined by the point circles A and B, and let Y be the family of all gener- 

alized circles through A and B. Then every member of ¥ is orthogonal to Two circles are 


every member of Y. orthogonal if they meet at 
right angles. 


Proof Let ¢ be the inversion in the unit circle with centre A. By Theorem 2, 
the Apollonian family is mapped by ¢ to the family of concentric circles 
with centre t(B). 


t(C) 


Now let C be an arbitrary member of Y. Since C is a generalized circle that 
passes through A and B, it follows that t(C) is a generalized circle that passes 
through t(A) = o6 and ¢(B). In other words, t(C) is an extended line through 
t(B). Clearly, this line t(C) intersects each of the circles with centre t(B) at 
right angles. Since inversion preserves the magnitude of angles, it follows that 
C meets each of the Apollonian circles in F at right angles. | 


For an Apollonian family of circles defined by point circles A and B, the 
Coaxal Circles Theorem states that every generalized circle through A and B 
meets each of the Apollonian circles at right angles. The next theorem enables 
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us to use this fact to deduce that A and B are inverse points with respect to 
each of the Apollonian circles. 


Theorem 4 ‘Two points A and B in the extended complex plane are inverse 
points with respect to a generalized circle C if and only if every generalized 
circle through A and B meets C at right angles. 


Proof We first show that A and B are inverse points if every generalized 
circle through A and B meets C at right angles. 

Let C; be any generalized circle through A and B that meets C at right 
angles, at points R and S. Now invert the figure C, in the generalized circle C. 
Since R and S remain fixed, C; is mapped to a generalized circle that meets 
C at right angles at R and S. It follows that C; is mapped to itself; for the fact 
that the image passes through R and S means that it must be a circle, and the 
radii of this circle through R and S must be along the tangents to C at R and S. 

In fact, by a similar argument, every generalized circle through A and B that 
meets C at right angles maps to itself. The only way this can happen is if A 
inverts to B, and vice versa. Hence A and B are inverse points. 

Next we show that if A and B are inverse points, then every generalized 
circle through A and B meets C at right angles. 

Let A and B be inverse points, and let C; be a generalized circle through 
A and B that meets C at R. Under inversion in C, the points A and B swap 
over and R remains fixed, so C; maps to itself. It follows that both the angles 
that C; makes with the circle C must be equal. These angles must therefore be 
right angles, which is what we want to prove. | 


Theorem 4 has two important corollaries. The first follows directly from the 
Coaxal Circles Theorem. 


Corollary 1 If C is an Apollonian circle defined by the point circles A 
and B, then A and B are inverse points with respect to C. 


Proof Let C be an Apollonian circle with respect to the point circles A and 
B. Then by the Coaxal Circles Theorem, every generalized circle through A 
and B meets C at right angles. By Theorem 4, A and B are inverse points with 
respect to C. Oo 


The second corollary asserts that inverse points are preserved by inversive 
transformations. 


Corollary 2 Let A and B be inverse points with respect to a generalized 
circle C, and let tf be an inversive transformation. Then ¢(A) and t(B) are 
inverse points with respect to the generalized circle ¢(C). 
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Proof 


C 1(C) 


1'(Cy) t(B) 


Let C, be an arbitrary generalized circle through ¢(A) and ¢(B). Then a (C1) 
is a generalized circle that passes through A and B. By Theorem 4, t~/(C}) 
meets C at right angles. But t preserves the magnitude of angles, so Cy meets 
t(C) at right angles. Since C is an arbitrary generalized circle through (A) 
and t(B), it follows from Theorem 4 that ¢(A) and t(B) are inverse points with 
respect to ¢(C). | 


An immediate consequence of Corollary 2 is that if Y is the family of all 
generalized circles that have two given points A and B as inverse points, then 
under an inversive transformation f the family .¥ maps onto the family of all 
generalized circles that have f(A) and f(B) as inverse points. 

At first sight this observation appears to be little more than a restatement 
of Corollary 2; however the restatement has a particular significance, as the 
following theorem shows. 


Theorem 5 Let ¥ be the family of all generalized circles that have A and 
B as inverse points. Then -¥ is either a concentric family of circles with 
centres A or B, or the Apollonian family of circles with point circles A 
and B. 


Proof First, suppose that either A or B is the point oo; to be definite, assume 
A = oo. Then each circle in ¥ has oo and B as inverse points. But this 
can happen only if B is the centre of each circle in ¥; in other words F is 
the family of concentric circles with centre B. A similar argument applies if 
B=. 

Next suppose that neither A nor B is the point oo, and let ¢ be the inversion 
in the circle of unit radius with centre A. By Theorem 2, t maps the family 
F to the family of all generalized circles with inverse points at f(A) = oo 
and t(B), namely the family of concentric circles with centre t(B). By The- 
orem 5, ¥ is the Apollonian family of circles defined by the point circles 
A and B. a 
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Roughly speaking, you 
can think of the family of 
concentric circles with 
centre A as an Apollonian 
family of circles defined 
by point circles A and oo. 
Then an Apollonian 
family of circles defined 
by the point circles A and 
B maps to an Apollonian 
family of circles defined 
by the point circles ¢(A) 
and t(B). 
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In light of the remarks that precede this theorem, it follows that under an 
inversive transformation ¢ an Apollonian family of circles defined by point 
circles A and B 


EITHER maps to an Apollonian family of circles defined by the point circles 
t(A) and t(B), 
OR maps to a concentric family of circles with centre t(A) or ¢(B). 


Similarly a concentric family of circles with centre A either maps to the Apol- 
lonian family of circles defined by the point circles t(A) and t(0o), or it maps 
to a concentric family of circles with centre ¢(A) or t(00). 
So far, we have used inversive geometry to prove results about Apollonian 
families of circles, but similar methods can be used to prove results about 
other plane figures. The technique is to use an inversive transformation (often 
an inversion) to map one or more circles in the figure to extended straight lines. 
The additional symmetry that results from this transformation is then used to 
establish the required result. 

For future reference, we also note here a fact that we met in the proof of 
Theorem 4 above. 


Corollary 3 Let C and C, be circles that meet at right angles at two points 
R and S. Then the centre of C; is the point T of intersection of the tangents 
to C at R and S; and C; maps to itself under inversion in C. 


Example 2 Let C;, Cz and C3 be circles in the plane such that C2 and C3 
touch at a point P, C; and C3 touch at a point Q, and C; and C»> touch at a 
point R. Let C be the circle that passes through P, Q and R. Prove that C cuts 
C1, C2 and C3 at right angles. 


Solution Let t be an inversion in a circle with centre R. Then t(C,) and 
t(C2) are straight lines. Moreover, since C; and C2 do not meet at any point 
other than P, it follows that t(C,) and t(C2) cannot meet in C and must 
therefore be parallel. 

Now C3 is tangential to Cy and C2, and does not pass through R, so t(C3) 
must be a circle that touches ¢(C;) and t(C2) tangentially at t(Q) and t(P), 
respectively. Next observe that C passes through R, so t(C) is the extended line 
through t(Q) and t(P). Since the transformed figure has reflectional symmetry 
about ¢(C), it follows that t(C) cuts t(C1), t(C2) and ¢(C3) at right angles at 
t(P) and t(Q). But ¢ preserves angles, so C cuts C;, C2 and C3 at right angles 
at P and Q. 

A similar argument with ¢ replaced by inversion in a circle with centre P (or 
Q) shows that C cuts C; and C2 at right angles at R. 


t(C)) 


Cc 
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5.5.3 Two Circles Determine a Coaxal Family 


It is clear that if we are given two circles in a coaxal family of mutually 
tangential circles, this determines the whole family. 


GO BH@Q 


Similarly if we are given two circles in a coaxal family of circles through 
two fixed points, this determines the whole family. This raises the question as 
to whether two circles in an Apollonian family are sufficient to determine the 
whole family. We now prove that this is indeed the case. The crucial tool that 
we need is the following result. 


Theorem 6 The Concentricity Theorem 

Let C; and C2 be any two non-intersecting circles in the plane. Then there 
is a inversive transformation that maps C; and C2 onto a pair of concentric 
circles. 


Proof If the circles are concentric then there is nothing to prove, so we may 
assume they are not concentric. 


Step 1 Pick a point O on the circle Cj, and invert both circles in the circle of 


unit radius with centre O. Under this inversion C, maps to a line, C i and C2 
maps to a circle, C4. 


Step 1 Cc,” 
ee 
° {) 


Step 2 Pick a point T on C4 that does not lie on the perpendicular from the 
centre of C), to the line C}, and let the tangent to C at T intersect C} at the 
point U, say. Draw the circle C4 with centre U and radius UT. This circle is 
perpendicular to C) at T, because a tangent to a circle is perpendicular to the 
radius at the point of contact. The circle C3 is perpendicular to the line C{ 
because its centre lies on the line C}. 


5: Inversive Geometry 


Coaxal Families of Circles 


C,’ 


Step 2 Step 3 


Step 3 Repeat the construct in Step 2 starting with another position for the 
point 7, to obtain a circle C), perpendicular to both C; and C’, at the respective 
points 7. Let C; and C/, meet at Q and R, say. 

Step 4 Invert the figure again, this time in the unit circle with centre Q. Then 
the line C; maps to a circle C/’, and the circle C5 maps to a circle C4; the 
circles C4 and C, pass through Q, so they invert to straight lines at right angles 
to Ci} and Cy. These lines are therefore diameters of the circles C// and CY, 
and so the point where they meet (the image R” of R) must be the centre of 
both circles. 


Step 4 
—_—_——> 


Step 5 Composing the inversion in Step 1 with the inversion in Step 4, we 
obtain an inversive transformation that maps C; and C2 onto the pair of con- 
centric circles C// and CY. a 


We can now prove the following beautiful result. 


Theorem 7 Two Apollonian Circles Theorem 
Let C; and C2 be two non-concentric circles that do not intersect. Then 
there is a unique Apollonian family of circles that contains C; and C2. 


Proof By the Concentricity Theorem there is an inversive transformation ¢ 
that maps C; and C2 onto a pair of concentric circles t(C,) and t(C2). Let O 
be the common centre of these circles, and let Y be the family of concentric 
circles with centre O. Under the inversive transformation t~! the family Y 
maps to the family of all generalized circles that have t/(O) and t~! (00) as 
inverse points; we denote this family by ¥. Now -¥ cannot be a concentric 
family of circles, since it contains Cy and C2; so, by Theorem 5, it must be an 
Apollonian family of circles. 
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As in the (second) proof 
of Theorem 1, the choice 
of | for the radius is not 

significant. 


Theorem 6 above 
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To complete the proof we must show that ¥ is the only Apollonian family of 
circles that contains C; and C2. To do this suppose that ¥’ is any Apollonian 
family of circles that contains C; and C2. Then #" is mapped by t¢ either to 
an Apollonian family of circles, or to a concentric family of circles. In fact 
the image of .¥’ under f contains the concentric circles t(C,) and t(C2) and 
must therefore be the family ¥ of concentric circles with centre O. It follows 
that Y and ¥’ are the same family of circles, for they are both mapped to 
G by t. H 


5.5.4 Some Applications of Inversion 


As we mentioned earlier, inversion can often be used to prove results about 
plane figures, using a suitable inversion to map one or more circles in the 
figures to extended lines — the additional symmetry then being used to establish 
the desired result. 

Our first application of inversion to prove a beautiful result is known as 
Steiner’s Porism. A porism is a mathematical construction problem that has 
a surprising answer: either the construction cannot be carried out or it has 
infinitely many solutions. 


Theorem 8 __ Steiner’s Porism 
Let C; and C2 be non-intersecting circles, with C; inside C2. Then: 


EITHER it is impossible to fit a chain of circles between C; and C2, with 
each circle touching C; and C2 and two other circles in the chain; 

OR it is possible to construct such a chain, and the first circle in the 
chain can be placed in any convenient position. 


Proof There are two possibilities. 


EITHER It is impossible to fit a chain between C; and C2 with the properties 
described, in which case the porism is established. 

OR There is at least one chain F that fits between C; and C2. To estab- 
lish the porism in this case we must show that a chain also exists for 
each choice of a first circle C between C, and C2. 


By the Concentricity Theorem there is an inversive transformation f that maps 
C, and C2 onto concentric circles C 1 and C5. Under ¢ the chain F maps onto 
a chain of circles F’ between C; and C4, and C maps onto a circle C’ which 
touches C; and C}. Since C} and C) have a common centre O, we can rotate 
the chain F’ about O until its first circle is superimposed on C’. If we denote 
the rotated chain by G’, then the inversive transformation t~! maps G’ back 
to a chain G between C; and C2. Moreover, the first circle of G is C as 
required. 
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Jakob Steiner 
(1796-1863) was a 
19th-century Swiss 
geometer. 


chain of circles 


Recall that ¢~! must be an 
inversive transformation, 
since f is. 


Coaxal Families of Circles 


chain of circles F’ |_| 


chain of circles F 


Another famous porism is Poncelet’s Porism: If C; and C2 are any two plane 
conics for which (for any givenn > 2) itis possible to find one n-sided polygon 
which is simultaneously inscribed in C; and circumscribed around C2, then it 
is possible to find infinitely many of them. 

Next, we prove a classical result known as Ptolemy’s Theorem — but using 
inversion rather than using Ptolemy’s Pythagorean geometry methods! A key 
tool in our proof is the following useful result. 


Lemma 1 Let a and b be points in C — {O}. Then, under inversion in a 


circle C with centre O and radius r the distance between the images of a 


r2 


and bis coe |a — bl. 


Proof Inversion in C may be represented in the complex plane by the 
transformation 


NI| 


t(z) = —, wherezeC—{O}. 


It follows from this formula that 


ye 72 
t(a) —t(b) = aE 
= aia —4@), 
a-b 
from which we may deduce that 
r2 
|t(a) —t()| = lap? —al, 
which is equivalent to the desired result. | 


The formula for distances in Lemma | plays a crucial role in our proof of 
Ptolemy’s Theorem that the sum of the product of the lengths of opposite sides 
of a cyclic quadrilateral equals the product of the lengths of the diagonals. 
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We omit a proof, as it is 
beyond the scope of this 
book. 


Claudius Ptolemaus 

(c. 85-165 AD) was a 
geometer, astronomer and 
geographer in Alexandria, 


Egypt. 


'. t(a) 


|t(a)-t(b) | 


t(b) 


This is a special case of a 
general formula for 
inversion in C that you 
met in Theorem 3, 
Subsection 5.2.1. 
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Theorem9 Ptolemy’s Theorem 
Let A, B, C, D be the vertices (in order round a circle) of a quadrilateral 
ABCD inscribed in a circle. Then 


7.V Dew >] Oi oy -V oem O) DEW .\ Our 3) DP 


Proof Let the transformation ¢ be inversion of the figure ABCD in the circle 
@ with centre A and radius 1. This inversion maps A to oo, the circle @ to an 
extended line (which we will denote by €@), and the points B, C, D on @ to 
points B’, C’, D’ on £. 

Since B, C, D are in order on @, it follows that B’, C’, D’ are in order on 
£. Consequently, 


B'C'+C'D! = B'D". (3) 


Since ¢ is self-inverse, the images of B’, C’, D’ under inversion in @ are 
B,C, D. Hence, if we apply the result of Lemma | in turn to the distances 
B'C’, C'D’, B'D’ under the inversion f, it follows from equation (3) that 


BC af CD __ ~BD 
AB-AC  AC-AD- AB- AD’ 
Multiplying both sides of this equation by the product AB - AC - AD, we obtain 
the desired result 


AD-BC+AB-CD=AC- BD. a 
Our final application of inversion is particularly attractive visually. 


Theorem 10 Shoemaker’s Knife 

Let D be a region in the upper half-plane, whose boundary consists of 
three semicircles S;, S2 and S3 that meet at three points A, B and C on the 
x-axis, as shown. Then it is possible to fit a chain of circles into D between 
S; and $2, with the first circle touching S;, $2 and $3, and with each suc- 
cessive circle touching $;, Sz and the previous and following circles in the 
chain. 


Proof Lett denote inversion in the circle @ with centre A and radius 1. 

Under f, the positive x-axis maps to itself, and points in the upper half-plane 
(the set of points above the x-axis) map to points in the upper half-plane. 

The circle of which $3 is a portion is mapped by ¢ to another circle, which 
crosses the x-axis at right angles, at points B’ and C’ say. The image of $3 is 
therefore another semicircle in the upper half-plane, with endpoints B’ and C’. 

Under f, the origin A maps to oo, so the semicircles $; and Sz map to 
portions of (generalized) circles in the upper half-plane that join co to B’ and 
C’, respectively. These portions do not intersect in the (ordinary) plane C, and 
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The name ‘Shoemaker’s 
Knife’ (or apBexoo, 
Arbelos) is due to 
Archimedes. 


t(S)) t(S5) 


1(S3) 


a 
B=t(B) C=C) x 


Exercises 


they meet the x-axis at right angles. Hence the images of S; and S2 are two 
vertical half-lines with endpoints B’ and C’. 

Now, it is obviously possible to fit a chain of circles into the vertical strip 
bounded by the semicircle ¢(S3) and the vertical lines ¢(S;) and t(.S2), with the 
first circle touching ¢(.S3) and with each successive circle touching ¢(.S;), f ($2) 
and the previous and following circles in the chain. 

We can transform this new configuration back into the original configuration 
by applying the inverse transformation t~!. Since t~! preserves circles and 
their tangencies, the required result now follows. | 


Problem 3 Prove that the height above the x-axis of the centre of the 

n-th circle, C;, say, in the chain of circles constructed in Theorem 10 is 

n times the diameter of that circle. 

Hint: Invert the figure in a circle with centre A that intersects C,, at 
right angles. 


Problem 4 In the figure for Theorem 10, let A, B and C have coor- 
dinates (0,0), (b,0) and (c, 0), respectively. Show that the centres of the 
circles in the chain constructed in Theorem 10 all lie on an ellipse with 
foci F (5b, 0) and F'(5c, 0), and major axis [ (0, 0), (5b +c), 0)]. 
Hint: Use the Sum of Focal Distances of the Ellipse theorem. 


5.6 Exercises 


Section 5.1 


1. Determine the image under inversion in the unit circle @ of each of the 
following points. 

@) B.-4) ®CLD) ©@0 @(3-%8) 

2. Determine the image under inversion in the unit circle @ of the following 
circles C (with the origin removed if it belongs to C): 

(a) the circle with centre (3, —4) and radius 5; 
(b) the circle with centre (1, 2) and radius 3. 

3. Determine the image under inversion in the unit circle @ of the following 
lines € (with the origin removed if it belongs to @): 
(a) the line y+ 3x =5; 

(b) the line y+ 2x = 0. 

4. Three circles C, C2, C3 pass through the origin O, and meet at three other 
distinct points D, E and F, as shown. The following sets of points are 
collinear: A, O and D; B, O and E; C, O and F. OA and OB are diameters 
of Cj and C2, respectively. Prove that OC is a diameter of C3. 

Hint: Invert the figure in a unit circle with centre O, and then use the fact 
that the altitudes of a triangle are concurrent. 
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Theorem 5, Subsection 
1.1.4 
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5. By means of a specific example, show that the centre of the image circle 
t(C2) of a circle C2 under inversion ¢ in another circle C; may not be the 
image of the centre of C2 under f. 

6. Let the origin O, P and Q be distinct points; and let P’ and Q’, respectively, 
be the images of P and Q under inversion in the circle with centre O and 
radius r. Prove that 


r2 


= op-00'? 


P’ Q’ 
(For simplicity, you may assume that O, Q, P are points inside the circle 
of inversion and occur in this order along the radius outwards from O. The 
result holds in all other cases too, by a similar argument.) 

7. Let P and Q be distinct points in C — {O} with the origin O, P and 
Q not being collinear; and let P’ and Q’, respectively, be their images 
under inversion in the circle with centre O and radius r. Let ON and ON’ 
be the perpendiculars from O to PQ and P’Q’ (extended, if necessary), 
respectively. 

(a) Prove that 


t / - 
Pe op.00'* 
(b) Prove that 
ON _ ON 
PQ P’'O 


8. Let AB be a diameter of a circle C, and let chords AD and BE (extended, 
if necessary) of C intersect at a point F. Prove that the circle DEF is 
orthogonal to the circle C. 


Section 5.2 
1. Let ¢ be the transformation defined by 


t(z) = 50+ V3i)z+2i (ze C). 


(a) Show that f represents an isometry. 
(b) Interpret ¢ as the composite of a rotation and a translation. 
(c) Interpret ¢ as a composite of reflections. 
2. Determine the image under inversion in the unit circle @ of each of the 
following points. 
(a) -3+4i (b) 5 — 123 
3. Let C be the circle of radius 2 with centre | + i. 
(a) Write down the inversion in C of the extended complex plane, as a 
transformation of C. 
(b) Determine the image of i under the inversion in C. 
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4. Let t be the extended linear function defined by 


—5iz+(2+6i), ifzeC, 


oO, ifz= oo. 


t(z) = 


Express f as a composite of inversions of the extended complex plane. 


5. Prove that two points z and z* in the complex plane correspond to 
two diametrically-opposite points on the Riemann sphere if and only if 
zZ-(z*) =—-1. 

Section 5.3 


1. 


Which of the following formulas define a Mobius transformation? For each 
formula that does define a Mobius transformation, state the image of oo 
under M. 


(a) M@) =-224) (6) M@) =-224+ 4 ©M@ = 2E 


. Decide which, if any, of the following matrices 


ai = A si 3 
ee) to ee Ce 


are associated with each of the following Mobius transformations. 


@MO=F O&M) = © ma) ==) 


. Determine a formula for each of the following Mobius transformations, 


where M, and M2 are the Mobius transformations defined in Exercise 2. 
(a) Mio Mi(z) (b) M10 Mx(z)_ (c) M2 0 My '(2z) 


. State which of the following transformations of € onto itself are inversive 


transformations, giving a brief reason in each case. 
@O=5 O@=5 Ore) = 24 


. Let ¢ be the inversive transformation defined by 


eg" 


ie | 


Determine the image of each of the following generalized circles under f: 
(a) the extended line £ U {oo}, where £ is the line with equation y = x; 
(b) the unit circle @. 


Section 5.4 


i 


For each set of three points below, determine the Mébius transformation 
that maps the three points to 0, 1, oo, respectively. 
(a) i, -i,00 §=6((b) 00, 1, (c) 27, 00,3 (d) 1, 2,3 


. Using the results of Exercise 1, determine the Mébius transformations that 


map: 
(a) 1,2,3 to 2i, c, 3; (b) co, 1,i to i, —i, 0; 
(c) 2i,c0,3 to o, 1,1; (d)i,-—i,co to 1,2,3. 


. Determine whether each of the following sets of points are collinear: 


(a) 3+i,2i,-6+6i (b)1+42i,4—5i, 10 — 20 
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4. Determine whether each of the following sets of points lies on a circle: 
(a) i, -3,14+21,2+4i (b)1—2i,-4+i7,4+i1,1+4+6i 


Section 5.5 


1. An Apollonian family of circles ¥ includes (6,0) as a point circle and also 
the circle C with equation x* + y* = 4. Determine the other point circle in 
the family. 

2. The circles Cy and C2 in an Apollonian family of circles have the segments 
[0,8] and i 5, respectively, of the x-axis as diameters. 

(a) Determine the point circles of the family. 
(b) Hence determine the equation of the Apollonian circle in the family 
that passes through the point (1,1). 

3. Determine the images under inversion in a unit circle with centre A of the 
following families of coaxal circles. 

(a) F is the family of all generalized circles tangential to the y-axis at the 
origin, and A = (0,0). 

(b) # is the family of all generalized circles through the points 
A = (—1,0) and B = (1,0). 

(c) F is the Apollonian family of all generalized circles with point circles 
A = (—1,0) and B = (1,0). 

4. Let C be the (extended) y-axis in the plane, and C2 the circle with centre 
(2, 0) and radius 1. Find a sequence of inversions whose composite inversive 
transformation maps C, and C2 onto concentric circles. 

5. Let C; and C2 be circles that touch at a point P, apart from which C2 
lies inside C,. Prove that it is possible to fit a chain of circles between 
C, and C2, with each circle touching C; and C2 and two other circles in 
the chain, and with the first circle in the chain placed in any convenient 
position. 

6. Prove that, if the Mébius transformation M maps @, the unit circle with 
centre the origin, to itself, then M is necessarily of the form 


M(z) =K2— ~, where|K|=1 and |o| #1. 
az—1l 


7. Let ¥ be an Apollonian family of circles with point circles A and B and 
radical axis €. Prove that, for any given point P on the radical axis, all the 
tangents from P to circles in are of equal length. 

8. Let D be the region in the upper half-plane whose boundary consists of 
three semicircles $,, S2 and $3 with diameter the segments [—1, 0], [0, 1] 
and [—1, 1], respectively, of the x-axis, as shown. 

(a) Prove that it is possible to fit a chain of circles into D between S$; and 
S2, with the first circle touching S$), S2 and $3, and with each successive 
circle touching S;, Sz and the previous and following circles in the 
chain. 
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2n 
4n2—] 


(b) Prove that the n"™ circle in the chain has its centre at a height 


above the x-axis, and has radius ao: 

9. Let C;, Co, C3 be three circles, external to each other, where the pairs C, 
and C2, Cz and C3, and C3 and C each touch at a point. Prove that there 
are exactly two circles that touch all of C,, C2, C3. 

These circles are sometimes called kissing circles or Soddy circles, and 
were discovered by René Descartes (1643), then rediscovered by Jakob 
Steiner (1826) and Frederick Soddy (1936). Soddy published his discov- 
ery as a poem titled The Kiss Precise, in the scientific journal Nature; it 


starts: 


For pairs of lips to kiss maybe 


Involves no trigonometry. Frederick Soddy 

‘Tis not so when four circles kiss (1877-1956) won the 
Each one the other three. 1921 Nobel Prize in 

To bring this off the four must be chemistry for research 

As three in one or one in three. into radioactive decay and 
If one in three, beyond a doubt the theory of isotopes. 


Each gets three kisses from without. 
If three in one, then is that one 
Thrice kissed internally. 


Summary of Chapter 5 


Section 5.1: Inversion 


1. Under reflection in a line @ in the plane, a point A is mapped to an 
image point A’ that lies an equal distance from @, but on the opposite side 
of £. 

2. Let C be a circle in the plane, with centre O and radius r, and let A be any 
point other than O. If A’ is the point on the line OA that lies on the same 
side of O as A and satisfies the equation OA- OA’ = r, then we call A’ the 
inverse of A with respect to (or ‘in’) the circle C. The point O is called 
the centre of inversion, and C the circle of inversion. The transformation 
t defined by t(A) = A’ (A € R? — {O}) is known as inversion in C. 

There is no point to which O is mapped by the inversion, and no point 
that is mapped to O by the inversion. 

Inversion in a circle is a generalization of reflection in a line. 

We often use the term inversion to mean EITHER reflection in a line OR 
inversion in a circle. 

3. If A is a point outside a circle C with centre O, AB and AC are the two 
tangents from A to C, and A’ is the point of intersection of OA and BC, 
then A and A’ are inverse points with respect to C. 

4. Inversion in a circle C maps points outside C to points inside C, and vice- 
versa; points on C map to themselves. 
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10. 


Inversion in a circle is a one-one transformation of the plane minus the 
centre of inversion onto itself. 

Inversion is a self-inverse transformation. 
Inversion is not a Euclidean transformation; for example, it does not 
preserve lengths. 

Inversion is not an affine transformation; for example, it does not 
preserve straight lines. 
Inversion in the unit circle @ is the function f : (x, y) (ai. wit), 


where (x, y) € R* — {O}. 
Strategy To determine an equation for the image of a curve under 
inversion in the unit circle @: 
1. write down an equation that relates the x- and y-coordinates of the 
points on the curve; 
2. replace x by x/(x? + y?) and y by y/(x? + y?), and simplify the 
resulting equation. 
If the curve passes through the origin we must first remove the origin from 
the curve. If we remove a point A from a curve, the curve is said to be 
punctured at A. 
Images of lines under inversion Under inversion in a circle with cen- 
tre O: 
(a) a line that does not pass through O maps onto a circle punctured at O; 
(b) a line punctured at O maps onto itself. 
Images of circles under inversion Under inversion in a circle with 
centre O: 
(a) acircle that does not pass through O maps onto a circle; 
(b) acircle punctured at O maps onto a line that does not pass through O. 
Even if inversion maps one circle onto another, it may not map centre 
to centre. 
Let c; and c2 be two curves that intersect at the point A, and let the tan- 
gents to the curves at A be ¢; and £2, respectively. Then the anti-clockwise 
angle from cj to c2 at A is the anti-clockwise angle from £; to £2, and the 
clockwise angle from c; to cz at A is the clockwise angle from ¢, to 09. 
Symmetry Lemma Let ¢ be a line that does not pass through the point 
O. Then under inversion in a circle with centre O, £ maps to a circle C 
(punctured at O), and the tangent to C at O is parallel to £. 
Angle Theorem An inversion in any circle preserves the magnitude of 
angles between curves but reverses their orientation. 


Section 5.2: Extending the Plane 


1. 


The transformation f(z) = z+c (z € C), where c = a+ib, is a translation 
through the vector (a, b). 

The transformation f(z) = z (z € C) is a reflection in the x-axis. 

The transformation t(z) = az (z € C), where |a| = 1, a = cos) + 
i sin 69, is a rotation through an anti-clockwise angle 69 about the origin. 
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Each isometry ¢ of the plane can be represented in the complex plane 
by one of the functions t(z) = az+ D or t(z) = az + b, where a, b € C, 
|a| = 1. Conversely, all such functions represent isometries. 

Every isometry can be expressed as a composite of reflections. 

2. The transformation t(z) = kz (z € C), where k is real and positive, is a 
scaling by a factor k. 

3. An inversion in a circle C of radius r with centre (a, b) may be represented 
in the complex plane by the transformation f(z) = _ +c(z eC — {c}), 
where c = a + ib. : 

In particular, inversion in the unit circle @ may be represented by the 
transformation t(z) = L (ze C— {O}). 

4. A linear function is a function of the form t(z) =az+b(z «€ ©), where 
a,beCanda £0. 

It may be decomposed into a composite fz o tj, where f; is the scaling 
t1(z) = |a|z and f2 is the isometry f2(z) = (a/|a|)z + b. This can be 
described geometrically as a scaling by the factor |a|, followed by a rota- 
tion through the angle Arg(a/|a|), followed by a translation through the 
vector (Reb, Imb). 

Linear functions preserve angles, and map circles and lines to circles 
and lines. 

5. The reciprocal function is the function of the form f(z)= 1 
(zeEC —{O}). 

It may be decomposed into a composite fz 0 t;, where ft, is the inversion 
t(z) = 1/Z and fp is the conjugation f2(z) = Z. 

The reciprocal function preserves angles. 

6. The extended (complex) plane C is the union of the complex plane C and 
one extra point, the point at infinity, denoted by the symbol oo. 

7. An extended line is any line @ in the plane together with the point oo. An 
extended line can be thought of as a circle of infinite radius. 

A generalized circle in the extended plane is a set that is either a circle 
or an extended line. 

8. Let C be a generalized circle in the extended complex plane. Then an 
inversion of the extended plane with respect to C is a function ¢ defined 
by one of the following rules. 

(a) If C is acircle of radius r with centre O, then 


the inverse of A with respect to C, 


if Ae C — {0}, 
t(A) = 
00, if A =O, 
O, if A = 00. 


(b) If C is an extended line @ U {oo}, then 


the reflection of Ain’, if A €C, 


t(A) = 
(A) OX, ifA=o. 
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10. 


11. 


12. 


13. 


Inversions of the extended plane map generalized circles onto gener- 
alized circles. 


. The extended conjugation function, extended reciprocal function and 


extended linear functions may be defined in the natural way as mappings 
of C to itself. 

The extended reciprocal function and the extended linear functions can be 
decomposed into a composite of inversions, and map generalized circles 
onto generalized circles. 

Strategy To scale ¢ by a factor k: 

1. invert in the circle {z : |z| = 1}, then 

2. invert in the circle {z : |z| = Vk}. 

Strategy To rotate ¢ through an angle 6: 

1. invert in the line {z : Arg z = 0} U {oo}, then 

2. invert in the line {z :Arg z= 361 U {oo}. 


Strategy To translate ¢ through a vector (a, b): 

1. invert in the line {(x, y) : ax + by = 0} U {oo}, then 

2. invert in the line {(x, y) : ax + by = (a? + b*)} U {oo}. 

The Riemann sphere S is the sphere in R* with centre the origin and 
radius 1. The points N(0,0,1) and S = (0,0,—1) are called the North 
Pole and the South Pole of S, respectively. 

Lines through the North Pole intersecting S at points P’and the (x, y)- 
plane at points P give a one-one correspondence between points on S and 
points in the plane. (We associate the North Pole with the point oo in the 
extended complex plane.) 

The one-one onto function x : S > € which maps points on S to the 
associated points in the extended complex plane is called stereographic 
projection. If (X,Y,Z) eS andz=x+ive C, then 


0 de ae 
w(X,Y,Z) = —— +i—_ 
1-Z 1-Z 
and 
2x 2y Fay = 1 
= ‘ 
mw (. = ’ ' ; 
a) (ayo eye eye I 


Under stereographic projection, circles on the Riemann sphere map onto 
generalized circles in C. In particular, circles on the sphere that pass 
through N map onto extended lines in C, and circles on the sphere that 
do not pass through N map onto ordinary circles in ¢. 

Stereographic projection preserves the magnitude of angles. 


Section 5.3: Inversive Geometry 


1. 


A transformation t : C — C is an inversive transformation if it can be 
expressed as a composite of inversions. 

Inversive geometry is the study of those properties of figures in C that 
are preserved by inversive transformations. 
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10. 


The extended reciprocal function and the extended linear functions are 
inversive transformations. 
Inversive transformations preserve the magnitude of angles, map general- 
ized circles to generalized circles, and preserve tangency. 

The set of inversive transformations forms a group under the operation 
of composition of functions, called the inversive group. 
Every Euclidean transformation is also an inversive transformation, and 
every Euclidean property is also an inversive property. 

The ‘doubling map’ t(z) = 2z (z € C) is an affine transformation and 
an inversive transformation, but is not a Euclidean transformation. 


The transformation of R? to itself given by t(x) = C i) X rep- 


resents a horizontal shear; it is an affine transformation, but is not an 
inversive transformation. 
A Mobius transformation is a function t : C > C of the form M (Zz) = 
acth where a, b,c,d € C, and ad — be #0. 

If c = 0, we adopt the convention that M(oo) = oo; otherwise we adopt 
the convention that M(—d/c) = oo and M(oo) = a/c. 

The extended linear functions and the extended reciprocal function are 
Mobius transformations. 

Every Mobius transformation is an inversive transformation. 
Mobius transformations preserve the magnitude and orientation of angles, 
and map generalized circles onto generalized circles. 


Let M be a Mobius transformation defined by M(z) = — where a, b, 


b 
c,d € C, and ad — bc # 0; then A = (< ‘) is a matrix associated 
Cc 


with M. 

A matrix associated with a M6bius transformation is invertible. 

If A is a matrix associated with a MGbius transformation M, then so is 
cA for any non-zero c € C. Every matrix associated with M has the form 
cA for some c € C — {O}. 

Let M; and M2 be Mobius transformations with associated matrices A; 

and Aj, respectively. Then M; o M2 is a Mobius transformation with 

associated matrix A; Ao. 

Strategy To compose two Mobius transformations M, and Mo: 

1. write down matrices A; and Ag associated with M, and Mo; 

2. calculate A; Ao; 

3. write down the Mobius transformation M, o M2 with which A;Az is 
associated. 


The inverse of the Mébius transformation M(z) = <+2 


= 18 also a Mobius 
transformation, and it may be written in the form M l(z) = a re 

The set of all Mobius transformations forms a group under composition of 
functions. 

Every inversion ¢ has the form f(z) =M(z), where M is a Mobius 


transformation. 
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Every inversive transformation ¢ can be represented in ¢ by one of 
the formulas f(z) = aan or t(z) = aoe where a, b, c, d € C, and 
ad—bc £0. 

The composite of an even number of inversions is a Mobius transfor- 
mation and is called a direct inversive transformation (since it preserves 
the orientation of angles); the composite of an odd number of inver- 
sions is called an indirect inversive transformation (since it reverses the 
orientation of angles). 


11. Strategy To determine the image of a generalized circle C under an 
inversive transformation f: 
1. write down three points z1, z2, z3 on C; 
2. determine the images f(z1), ¢(z2), t(z3); 
3. the image ¢(C) is the (unique) generalized circle through t(z1), t(z2), 
t(z3). 


Section 5.4: Fundamental Theorem of Inversive Geometry 


1. Fundamental Theorem of Inversive Geometry 
Let z1, Z2, z3 and w 1, w2, w3 be two sets of three points in the extended 
complex plane C. Then there is a unique M6bius transformation M which 
maps Zz; to wy, Z2 to w2, and z3 to w3. 

2. Strategy To determine the Mobius transformation M which maps three 
given points z1, Z2, z3 onto the points 0, | and oo, respectively: 
1. choose the appropriate form of mapping from the following formulas 


for M: 
mapping form of M 
Zi, 3, a 0 Ico: | KF 
K 
00, 22,23 > 0, 1,00 | > 
£721 


Z1, ©, 73+ 0, 1, 00 


Z1, 22, Of 0,1,00 | K(z—2z1) 
2. find the complex number K for which M(z2) = 1. 
3. Strategy To determine the Mobius transformation M which maps the points 
Z1, Z2, 23 to the points w1, w2, w3, respectively: 
1. find the Mobius transformation M; which maps the points z1, z2, z3 to 


the points 0, 1, 0, respectively; 

2. find the Mobius transformation M2 which maps the points w1, w2, w3 
to the points 0, 1, oo, respectively; 

3. calculate M = M;' oM,. 

4. Strategy To determine whether z1, z2, z3 and za lie on a circle: 

1. find the Mobius transformation M which maps Zz}, Z2, z3 to 0, 1, ~, 
respectively; 

2. the points z;, Z2, 23, z4 lie on a generalized circle if and only if M(z,4) is 
real; 


Summary of Chapter 5 


3. the generalized circle in Step 2 is a circle provided that M(oo) is not 
real. 
If M(z4) and M(oc) are both real, then z1, z2, z3, z4 lie on a line. 
5. Let C; and C2 be generalized circles in the extended complex plane. Then 
there is a Mobius transformation that maps C; onto C2. 
All generalized circles are inversive-congruent. 


Section 5.5: Coaxal Families of Circles 


1. Apollonian Circles Theorem Let A and B be two distinct points in the 
plane, and let k be a positive real number other than 1. Then the locus of 
points P that satisfy PA: PB = k: 1 is a circle whose centre lies on the line 
through A and B. 

The centre of the circle is not at either A or B. If A=(—a,0) and B = 
(a, 0), then the circle has centre c and radius r where c = (-a (1445) 0 


2ak 
[1k |° 
If k = 1, then P lies on the perpendicular bisector of AB. 


Every positive value of k gives rise to a generalized circle, known as a cir- 
cle of Apollonius; the family of all such circles is known as the Apollonian 
family of circles defined by the points A and B. 

If 0 <k <1, the Apollonian circle surrounds A; if k > 1, the Apollonian 
circle surrounds B. The points A and B are called point circles, corre- 
sponding to the cases k = 0 and ‘k = oo’, respectively. Every point in the 


andr = 


plane lies on precisely one circle (or generalized circle) in each Apollonian 
family. 

2. Let A and B be distinct points in the plane, and let ¢ be the inversion in 
the circle with centre A and radius 1. Then the Apollonian family of circles 
defined by the point circles A and B is mapped by ¢ to the family of all 
concentric circles with centre t(B); and the family of all concentric circles 
with centre ¢(B) is mapped by ¢ to the Apollonian family of circles defined 
by the point circles A and B. 

3. A coaxal family of circles in the plane is a family of (generalized) circles 
of one of the following types: 

1. an Apollonian family, with particular point circles; 
2. a family that intersect at one particular point; 
3. a family that intersect at two particular points. 
The extended line in each family is called the radical axis of the family. 
Coaxal Circles Theorem Let A and B be distinct points in the plane. 
Let ¥ be the Apollonian family defined by the point circles A and B, and 
let Y be the family of all generalized circles through A and B. Then every 
member of .¥ is orthogonal to every member of Y. 

4. Two points A and B in the extended complex plane are inverse points with 
respect to a generalized circle C if and only if every generalized circle 
through A and B meets C at right angles. 
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If C is an Apollonian circle defined by the point circles A and B, then A 
and B are inverse points with respect to C. 

Let A and B be inverse points with respect to a generalized circle C, and 
let t be an inversive transformation. Then ¢(A) and ¢(B) are inverse points 
with respect to the generalized circle t (C). 

Let ¥ be the family of all generalized circles that have A and B as 
inverse points. Then .¥ is either a concentric family of circles with centre 
A or B, or an Apollonian family of circles with point circles A and B. 

Let C and C, be circles that meet at right angles at two points R and S. 
Then the centre of C; is the point T of intersection of the tangents to C at 
R and S; and C; maps to itself under inversion in C. 

5. Concentricity Theorem Let C; and C2 be any two non-intersecting circles 
in the plane. Then there is an inversive transformation that maps C; and C2 
onto a pair of concentric circles. 

6. Two Apollonian Circles Theorem Let C; and C2 be two non-concentric 
circles that do not intersect. Then there is a unique Apollonian family of 
circles that contains C; and C9. 

7. A porism is a mathematical construction problem that has a surprising 
answer: either the construction cannot be carried out or it has infinitely 
many solutions. 

Steiner’s Porism Let C; and C» be non-intersecting circles, with Cy 
inside Cz. Then EITHER it is impossible to fit a chain of circles between 
C, and C2, with each circle touching C; and C2 and two other circles in the 
chain OR it is possible to construct such a chain, and the first circle in the 
chain can be placed in any convenient position. 

Poncelet’s Porism If C; and C2 are any two plane conics for which (for 
any given > 2) itis possible to find one n—sided polygon which is simul- 
taneously inscribed in C; and circumscribed around C2, then it is possible 
to find infinitely many of them. 

7. Let a and b be points in C — {O}. Then, under inversion in a circle C 
with centre O and radius r the distance between the images of a and b is 
dala - Bl. 

Ptolemy’s Theorem Let A,B,C,D be the vertices (in order) of a 
quadrilateral inscribed in a circle. Then AD - BC+ AB-CD = AC. BD. 

8. Shoemaker’s Knife/Arbelos Let D be a region in the upper half-plane, 
whose boundary consists of three semicircles $;, S2 and $3 that meet at 
three points A, B and C on the x-axis. Then it is possible to fit a chain of 
circles into D between S; and $2, with the first circle touching $,, $2 and 
S3, and with each successive circle touching $;, S2 and the previous and 
following circles in the chain. 


6 Hyperbolic Geometry: 
the Poincare Model 


The book, Euclid’s Elements, which underlay most geometrical teaching in 
Western Europe for over 2000 years, gave definitions of the basic terms in 
geometry and rules (called postulates) for their use. Many of Euclid’s assump- 
tions seem entirely uncontroversial, such as the assertions that through any 
two distinct points in a plane or in space there passes a unique line, and that 
a line can be extended indefinitely in both directions. On these foundations 
Euclid gave rigorous proofs of theorems in elementary geometry, which could 
be accepted as true because of the way they had been established. 

Among the postulates for Euclidean geometry is one about parallel lines 
which is equivalent to the following statement. 


The (Euclidean) Parallel Postulate Given any line ¢ and a point P not 
on £, there is a unique line m in the same plane as P and ¢ which passes 
through P and does not meet @. 


Note that the parallel postulate makes two assertions: first that the parallel 
line exists, and second that it is unique. 

Historically it was felt that the Parallel Postulate as it stands is not obvious. 
It implies that every line through P other than m eventually meets @, but plainly 
this meeting place can be a long way away. How much confidence would you 
place in an assumption about lines meeting somewhere in the Virgo star cluster, 
some 100 million light years away? 

Many Greek, Arab and later Western geometers had felt that the answer was 
‘not much’, and had tried to delete the Euclidean Parallel Postulate from the 
list and instead derive it as a theorem. They all failed, and in the end the reason 
was laid bare: the Parallel Postulate cannot be made into a theorem in this way 
because there are internally consistent models of geometries which obey all 
the Euclidean postulates except the Parallel Postulate. 

But how do we obtain geometries other than Euclidean geometry with its 
Parallel Postulate? One way to do this might be to insist that any two lines 
intersect. The geometry to which this gives rise is called Elliptic Geometry. 
One model of elliptic geometry is Spherical Geometry (for example, on the 
surface of the Earth), and we shall investigate this model in Chapter 7. 


great circle 


m 


N 
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A ‘line’ PQ on the surface of a sphere (which we call S*), such as on the 

Earth’s surface, can be defined to be the great circle (that is, the intersection 

of the plane through the centre O of the sphere with the surface of the sphere) 

through the two points P and Q. Any two distinct great circles meet in two 

(diametrically opposite) points, such as P and P’ shown in the margin, so any P 

two lines in spherical geometry meet in two points. ed 7 
Generalizing this fact, in Elliptic Geometry we keep all the other postulates . 

that collectively describe Euclidean geometry, remove the Parallel Postulate, 

and replace it with the following analogue. 


The (Elliptic) Parallel Postulate Given any line @ and a point P not on 
£, all lines through P meet ¢. (That is, there are no lines through P that are 
parallel to @.) 


Another way to obtain a non-Euclidean geometry is to insist that the line 
through P that does not meet @ is not unique. We could again keep all the other 
postulates that collectively describe Euclidean geometry, remove the Parallel 
Postulate, and replace it with the following. 


The (Hyperbolic) Parallel Postulate Given any line @ and a point P not 
on £, there are at least two lines m through P that do not meet @. (That is, L 
there are at least two lines through P that are parallel to ¢.) 


The geometry whose postulates are those of Euclidean geometry but with In fact there are still other 
this variant of the Euclidean parallel postulate is called Hyperbolic Geome- geometries (and other 
try. The term non-Euclidean Geometry is often used to describe elliptic and —'™0dels of both hyperbolic 
hyperbolic geometries together. cae Bar be eine 

Non-Euclidean geometry was discovered independently in the late 1820s 6 this book. 

by the distinguished Russian mathematician Nicolai Ivanovich Lobachevskii 
(1792-1856), working at the University of Kazan, and the Hungarian Janos 
Bolyai (1802-1860), an Army officer whose father, Farkas (or Wolfgang) 
(1775-1856), was also a mathematician. Many of the same ideas were also 
known to Carl Friedrich Gauss (1777-1855), but he did not publish them. 
However, non-Euclidean geometry was not accepted until after these three 
were dead, when the German Bernhard Riemann (1826-1866) and the Ital- 
ian Eugenio Beltrami (1835-1900) published their ideas about geometry in the 
late 1860s. Riemann gave the appropriate general setting, Beltrami a specific 
account of non-Euclidean geometry. Only then did the new geometry become 
rapidly accepted. Amongst other things, it spurred Felix Klein (1849-1925) to 
propose his view of geometry. 

Non-Euclidean geometry was one of the most momentous mathematical dis- 
coveries of the 19th century. It had several revolutionary implications, because 
it provided a physically plausible description of space that differed markedly 


Hyperbolic Geometry: the Disc Model 


from Euclid’s. It became possible to imagine that the universe is not Euclidean, 
and in the 19th century some mathematicians and astronomers, such as Gauss 
and his friend Bessel), entertained the idea quite seriously. 

The theorems in non-Euclidean geometries often differ markedly from their 
Euclidean equivalents. For example, in Euclidean geometry the angles of any 
triangle sum to z; in elliptic geometry the angle sum of any triangle turns out 
to be strictly greater than 2, whereas in hyperbolic geometry the angle sum of 
any triangle turns out to be strictly less than z. 

In this chapter we shall concentrate on a model of hyperbolic geometry due 
to the French mathematician Henri Poincaré. In this model (which we will 
generally call ‘hyperbolic geometry’ for simplicity), the space of points is 
the interior of the unit disc Y = {z: |z| < 1}. All figures in this geometry 
will be drawn as they appear in this disc. In Section 6.1 we define hyperbolic 
lines and hyperbolic angles. We then introduce hyperbolic reflections in these 
lines, and obtain the group of hyperbolic transformations: a hyperbolic trans- 
formation is a composite of hyperbolic reflections. In Section 6.2 we see that 
every hyperbolic transformation can be described as either a Mobius transfor- 
mation (of a certain form) or a Mébius transformation composed with complex 
conjugation. 

The properties of hyperbolic geometry include angle and length, and so we 
can define hyperbolic circles, which we do in Section 6.3. We also obtain a 
formula relating Euclidean lengths and hyperbolic lengths (at least in special 
cases), and in Section 6.4 we give useful formulas for the study of hyper- 
bolic triangles. In Section 6.5 we look at area in hyperbolic geometry, and at 
hyperbolic tilings, or tessellations, such as the one displayed on the cover of 
this book. Finally, in Section 6.6 we briefly discuss the half-plane model of 
hyperbolic geometry. 


6.1 Hyperbolic Geometry: the Disc Model 


6.1.1 


The points or d-points (where ‘d’ stands for ‘disc’) in (Poincaré’s version of) 
hyperbolic geometry consist of the points in the unit disc 


What is Hyperbolic Geometry? 


9 =e: |e <= [oy +y? <I}. 
In sketches we usually illustrate Y by drawing in its boundary 
@={z:|l=Y= {@,y) 22+y? = i}, 


but you should note that points on @ are not points which belong to the 
geometry. 

A crucial feature of this geometry is the concept of a d-line, or a line in 
hyperbolic geometry. 
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Friedrich Wilhelm Bessel 
(1784-1846) was a 
German mathematician 
and astronomer, after 
whom Bessel functions 
are named. 


You met Mobius 
transformations in 
Subsection 5.3.3. 


Note that we shall use the 
alternative notations z and 
(x, y) freely to describe 
points in Y, according to 
convenience. 
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Definition A d-line is that part of a (Euclidean) generalized circle which 
meets @ at right angles and which lies in J. 


Every d-line is part of a generalized (Euclidean) circle which meets the 
boundary circle in two points. We call these two points the boundary points 
of the d-line. Notice that the boundary points of a d-line are not d-points — for 
they are not points in J. 

The sketch below illustrates various d-lines. Some of these d-lines are arcs 
of (Euclidean) circles; some are (Euclidean) line segments, in fact, diameters 
of J. 


a diameter 


7 


boundary points 


In fact, any d-line that is part of a Euclidean line must be a diameter of J, 
since otherwise it cannot meet @ at right angles. 

Conversely, any d-line @ that is an arc of a Euclidean circle cannot pass 
through the origin. For if 2 meets @ at a point P, then the tangent ¢’ to £ at P 
is a radius of @ and so passes through the origin O, which is the centre of @. 
But then @’ is a line touching the Euclidean circle € at P, and because circles 
lie entirely on one side of their tangents, ¢’ cannot meet ¢ again. Thus @ cannot 
pass through O. 

Indeed, if @ is a d-line that is an arc of a Euclidean circle, then there is a 
diameter of F that does not meet it. For, if € meets @ at the points P and 
Q, then the tangents to @ at P and Q are radii of @ and so pass through 
the origin. They divide the disc Y into four regions, one of which con- 
tains ¢. Clearly, a diameter can be drawn that lies entirely in two of the other 
regions — in fact, you can see that there are infinitely many such diameters. 
Each of these divides Y into two regions, one which contains @ and one which 
does not. 

Notice also that if a d-line @ is part of a Euclidean circle meeting @ at points 
P and Q, then its centre is the point R where the tangents to @ at P and Q 
meet. So the point R lies outside the circle @. 


A generalized circle is 
either a circle or a 
(Euclidean) line; see 
Subsection 5.2.3. 


You may find the 
following figure a helpful 
reminder for the shapes of 
d-lines: 


O 
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Sometimes it is clear from a (reasonably accurate) sketch whether or not 
part of a generalized circle is a d-line. For example, consider 


l= [x.y rx? +y?-4y =0] 0, 
Since 
xt y?—4y =06 x74 (y—2) =4, 
we can sketch £, and deduce that it is not a d-line since it clearly does not meet 


@ right angles. (The tangents to @ at its boundary points on @ do not pass 
through the origin.) 


Problem 1 Sketch the following parts of generalized circles, and 
determine which of them are d-lines. 


£p={(x,y): y=3x}NG 
fg ={(x,y):3x+y=1}NG 
= [@.y):2? +9? +2x42y+1=01N9 
Now, it is rather easy to be mislead by a sketch into believing that something 


is true in general whereas it is not! So, we now give an analytic criterion for 
identifying d-lines. 


Lemma 1 The equation of a d-line ¢ is of one of the following forms: 
ax+by=0, where a and b are not both zero; 


go eye hestey ol 10: where f* + g* > 4. 


Proof If ad-line @ is (part of) a Euclidean line through the origin, then cer- 
tainly its equation is of the form ax + by = 0, where a and b are not both 
zero. 

The other possibility is that the d-line ¢ is (part of) a Euclidean circle C that 
intersects the boundary @ of F at right angles. So, let C have equation 


eo + fe+ gy+h =0, for some real numbers f, g andh. 


First, it follows from the Orthogonality Test that any circle with this equation 
intersects the unit circle @ with equation 


x+4+y-1=0 
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7 (0, 2) 


Subsection 1.1.2, 
Theorem 3 
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if and only if 
f-0+g-0=2-(h+(—1)); 
that is, if and only if h = 1. So the equation of C must be of the form 


Ey +fx+gy+1= 0, for some f and g. 
2 
Next, the centre (-3/.-48) of C must be outside @, so that (-3 ) + 


2 
(- 5 8) > 1, which we can equivalently write in the form f* + g? > 4. 
Finally, we need the circle C to meet Y. This happens if and only if 


1+ radius of C > distance of the centre of C from O; 


since h = | we can write this requirement in the form 


1 1 1 1 
1 , 2 2 1 2 2 2 1 
Sy ar fae (1) 


Since we already know that ,/ 7 L p24 24 5g? > 1, the inequality (1) follows at 


once if we can prove that 
1+ JVt—1> Vt, whenevert > 1. (2) Here we are writing f in 
So, let F(t) = 1+ V?—1 — Vi. Then Bboy hae 
F()=14+V0-Vvi 


= 0, 

and 
F(t) = 1 1 
“Sjpul Qt 
_vi- i 5 
=) oe 

It follows at once that F(t) > F(1) = 0 fort > 1, and so that the inequality 
(2) holds, as required. | 
Thus, for example, let C be the circle in R? with equation In fact, C is the circle with 


x2 4 y? 3e=— 3p 4 SO, centre (-3, 1) and radius 
and let ¢ be the part of C that lies in Y. Then ¢ is a d-line since its equation is (- 3) 4-2 1 
of the form required by Lemma 1, with f = 3 and g = —2, and f* + g? 
3? 4 (-2)? = 13 > 4. 

Note the following useful result that we obtain from putting a= — 5f, 
b= —38 into Lemma 1. 


Corollary 1 Let a = a+ ib be a point outside the unit circle @. Then the 
d-line that is part of a Euclidean circle with centre w has equation 


ee le 
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We can now turn to the question of parallelism, the notion that gave rise to 
the discovery of non-Euclidean geometries in the first place. 


Definition Two d-lines that do not meet in Y are: Recall that the boundary 
parallel if the generalized Euclidean circles of which they are parts meet at_ © #810" part of the unit 
a point on @; disc 2. 

ultra-parallel if the generalized Euclidean circles of which they are parts 

do not meet on @. 


Remarks 


1. It follows that given any d-line @ and any point P in ZY which is not on £, 
there exist exactly two d-lines through P which are parallel to @, as the next 
figure shows. A rigorous proof that such d-lines exist will be found below 
(Problem 5). 


2. Similarly, corresponding to any given d-line @ and any point P in Z which 
is not on £, there exist infinitely many d-lines through P which are ultra- 
parallel to €. 


Problem 2 The following sets represent d-lines: 
f)={@, y): y=x}NF,; 
(y= {@, y) ix? +y?- 4x +1=0} 9; 
3 = {(x,y) ix? +y? -2V2x+1=0}N9; 
lg ={Q,y) x? +y? 42x +2y+1=0} NF. 


Sketch them, and decide from your figure which of the d-lines intersect 
each other, which are parallel, and which are ultra-parallel. 
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Problem 3 Sketch three d-lines ¢;, €2 and €3 with the property that 
£1 is parallel to £2, and @2 is parallel to £3, but £; is not parallel to £3. 


In Euclidean geometry and in inversive geometry, reflection and inversion 
played important roles; this is also the case in hyperbolic geometry. Obviously, 
reflection of the unit disc Z in a diameter maps ZY onto itself; in fact, so does 
inversion of Y in a d-line that does not pass through the origin. 


Theorem 1 Let @ be a d-line that is part of a Euclidean circle C. Then 
inversion in C maps @ onto @, and Y onto J. 


Proof Let C meet @ at the points A and B. 

Under inversion in C, A and B map to themselves. Since inversion preserves 
angles, the circle @ maps onto some circle that meets C at right angles at the 
points A and B. There is only one such circle, @ itself; it follows that inversion 
in C maps @ onto itself. 

Hence the inside Y of @ must map either to the inside or to the outside of 
the image of @, namely @ itself. But the points of € map to themselves under 
this inversion, so the image of Y must be F itself. |_| 


We now know that inversion in a d-line maps the disc Y onto itself. We 
also know that the composition of an inversion with itself is the identity map. 
Because the analogous properties are true of Euclidean reflections, we make 
the following definition. 


Definition A hyperbolic reflection in a d-line £ is the restriction to of 
the inversion in the generalized circle of which £ is part. 


Remarks 


1. If the d-line @ is part of a Euclidean circle C meeting the boundary circle 
@ at points P and Q, then the point R where the tangents to @ at P and Q 
meet is the centre of the inversion. 

2. Notice that Theorem | may be reformulated as follows: hyperbolic reflec- 
tion in any d-line maps the unit disc J onto itself. 


Hyperbolic reflections are the building blocks of transformations in hyper- 
bolic geometry. Composites of a finite number of hyperbolic reflections are 
called hyperbolic transformations. The set of all such transformations under 
the operation of composition of functions forms a group, called the hyperbolic 
group, Gg. 


Definition Hyperbolic geometry consists of the unit disc, Y, together 
with the group Gg of hyperbolic transformations. 
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It follows that the relation 
‘is parallel to’ is not an 
equivalence relation in 
hyperbolic geometry. 


Subsection 5.1.3, 
Theorem 5 


Inversion in C maps the 
shorter arc AB of @ onto 
the longer arc AB of @, 
and vice versa. 


Inversion in C maps the 
region bounded by ¢ and 
the shorter arc AB of @ 
onto the region bounded 
by £ and the longer arc AB 
of @, and vice versa. 


Recall that we use the 
term ‘inversion’ to mean 
either reflection in a line 
or inversion in a circle 
(see Subsection 5.1.1, 
‘Convention’ ). 


A proof of the fact that 
these form a group will be 
found as an Exercise in 
Section 6.7; it consists 
simply of checking the 
axioms for a group. 
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Remarks 


_ 


. Notice, in particular, that the identity mapping of 7 to itself belongs to Gg, 
since it may be expressed as the finite composition ro r~! for any hyper- 
bolic reflection r. Also, if tj 0 f2 0...0f, is any hyperbolic transformation, 
then its inverse is another hyperbolic transformation ¢,- ee ty are ty iu 

. Let the generalized circle C of which a d-line £ is part meet the boundary 
circle @ in points P and Q, and let r be the hyperbolic reflection in @. Then 
although r has domain the unit disc Y, and P, OQ ¢ Y, we sometimes find it 
convenient to call P and Q the boundary points of £ and to write the images 
of P and Q under inversion in C (of which r is a restriction) as r(P) and 
r(Q). The same convention is extended to the case where r is a hyperbolic 
transformation. 


i) 


Next, we define the idea of angle in hyperbolic geometry. 


Definition The (hyperbolic) angle between two curves (for example, 
two d-lines) through a given point A in ZY is the Euclidean angle between 
their (Euclidean) tangents at A. 


Now, Euclidean reflections and inversions both preserve the magnitudes of 
angles. It follows that hyperbolic transformations also preserve magnitudes of 
angles. Also, Euclidean reflections and inversions map generalized circles onto 
generalized circles. Combining this with the angle-preservation property, we 
deduce that hyperbolic reflections and inversions map d-lines onto d-lines — 
and so compositions of a finite number of such transformations also have this 
property. We summarize these facts in the following result. 


Theorem 2 Hyperbolic transformations map d-lines onto d-lines, and 
preserve the magnitudes of angles. 


In the remainder of Chapter 6 we study the properties of various figures 
under hyperbolic transformations, and obtain surprising (and sometimes 
beautiful) results! 


6.1.2 Existence of d-lines 


Through a typical point A of Y there is at least one d-line: namely, the diameter 
through the origin O and A. But through the origin there are infinitely many 
d-lines: the diameters of Y. Is there more than one d-line through an arbitrary 
point A of J? 

The first step towards answering this question is the following useful result, 
which shows that there is a hyperbolic transformation which maps A to O. 


Lemma 2 Origin Lemma 
Let A be a point of Y other than the origin O. Then there exists a d-line @ 
such that hyperbolic reflection in £ maps A to O. 


e 
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Proof We seek a d-line £ which is part of a Euclidean circle with centre R, 
say, such that inversion in this circle maps A to O. Suppose that this circle 
meets @ at the point P. 

The condition that this inversion maps A to O is 


RO-RA = RP”, (3) 


since RP is a radius of the circle we seek. 
The condition that part of this circle is a d-line is that triangle ARPO is 
right-angled at P. By Pythagoras’ Theorem, this implies that 


RP? + PO? = RO”, 
which, since OP = 1, is equivalent to 
RP? = RO? — 1. (4) 
Eliminating the radius RP ? from equations (3) and (4), we deduce that 
RO- RA = RO* — 1. 


This is equivalent to 
RO? — RO-RA=1 
or 
RO .- (RO — RA) = 1. 
But RO — RA = AO, so we deduce that 
RO-.AO=1 


which is equivalent to 
OA-OR= 1. 
This tells us that the circle we seek has for its centre the point R which is 
found by inverting the given point A in the boundary circle @— an unexpectedly 
memorable result! a 


The great value of the Origin Lemma is that it enables us to study any prob- 
lem in hyperbolic geometry by mapping a suitably chosen point to the origin, 
thereby yielding a (frequently) simpler picture than before — yet without losing 
any generality. We shall use this method often in this chapter. 

We can, for example, use this approach to answer the question of how many 
d-lines pass through a given point of Z. 


Theorem 3 Let A be a point of Z. Then there exist infinitely many d-lines 
through A. 


Proof Let A be the origin. As we said above, each of the infinitely many 
diameters of Y passes through the origin, and each of these diameters is a 
d-line. 

If A is not the origin, then by the Origin Lemma, there is a hyperbolic trans- 
formation, r say, that maps A to the origin O, through which pass infinitely 
many d-lines — all the diameters of J. 
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e e e 
Since r~! is also a hyperbolic transformation, it follows that the images of 
these diameters are also d-lines — and they pass through A. a 


An important result in Euclidean geometry is that there is exactly one line 
through any two given points. There is an analogous result in hyperbolic 
geometry. 


Theorem 4 Let A and B be any two distinct points of Y. Then there exists 
a unique d-line @ through A and B. 


Proof (Existence) By the Origin Lemma, there is a hyperbolic transformation 
r that maps A to the origin O; let the image of B under r be the point B’ of J. 


e e 


Then there is a unique d-line ¢’ (a diameter of Y) that passes through O and 
B’. Since r~! is also a hyperbolic transformation, it follows that € = r—!(¢’) 
is also a d-line (by Theorem 2) — and it passes through A and B. 

(Uniqueness) Suppose that £; is another d-line through A and B. Then r(¢1) 
is ad-line that passes through O and B’. It follows that r(€,) = £’, so €; must 
be the same as r—!(€’) = €. This proves the uniqueness of the d-line through 
A and B. a 


Problem 4 Let A, and A) be any two points of Y. Use the Origin 
Lemma to prove that there is a hyperbolic transformation that maps A; 
to Ao. 


In fact, we can establish a result stronger than that of Problem 4: we can 
determine a hyperbolic transformation which maps A; to Az and maps any 
given d-line through A, to any given d-line through A2. 


Theorem 5 Let A; and A2 be any two points of Y, and let 2; and £2 
be d-lines through A, and Aj, respectively. Then there is a hyperbolic 
transformation which maps A, to A2 and £; to £2. 


For, hyperbolic 
transformations map 
d-lines to d-lines, by 
Theorem 2. 
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Proof By the Origin Lemma, there are hyperbolic transformations r; and r2 
which map A, and Ao, respectively, to the origin O. Let the images of ¢ 
and €2 under r; and rz be the d-lines a and bos respectively. Let r3 be the 
hyperbolic transformation which rotates Z about O so that €, maps onto £,. 


& So = 
— — ama 
ry r3 + 
e c) e e e 


r 


Now let r be the hyperbolic transformation given by 


r=r,\orzort. 


Then r maps A to A? and £; to £2, as required. | 


Remark 


We can choose the rotation r3 to map the two parts of — O onto the two parts 
of £, — O in whichever way we please. Hence we can arrange for r to map 
the two parts of 2; — A, onto the two parts of £2 — Az in whichever way we 
please. 

In the following example we show that parallel d-lines map to parallel 
d-lines under a hyperbolic transformation. 


Example 1 Let A, B and C be points of @ such that the d-lines AB and BC 
are parallel. Let r be a hyperbolic transformation under which the images of 
AB and BC are the d-lines A’ B’ and B’C’, respectively, where A’, B’ and C’ 
are points of @. Show that A’B’ and B’C’ are parallel d-lines. 


Solution First, we show that A’B’ and B’C’ do not meet in J. If they do, 
let them meet at the point P’, say. Then the point P’ corresponds to a point 
P, say, on the d-lines AB and BC; that is, r—l(P’) = P. But there is no such 
point, since AB and BC are parallel; it follows that A’ B’ and B’C’ do not meet 
in J. 

Next, the generalized circles of which A’B’ and B’C’ are parts meet at a 
point B’ on @. So we conclude that the images of A’ B’ and B/C’ are parallel 
d-lines. 


Problem 5 Show that, given a d-line and a point P not on @, there 
are exactly two d-lines through P which are parallel to @. 
Hint: Use the Origin Lemma. 
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We shall verify in 
Subsection 6.2.1 that any 
rotation of Y about O isa 
hyperbolic transformation. 


Note the convenient use of 
the label AB for a d-line 
even though A and B do 
not lie on that d-line (they 
lie on @). There is no 
ambiguity since there is 
only one circle through A 
and B with centre the 
point where the tangents 
to @ at A and B meet. 
Analogously, we shall say 
that a d-line passes 
through its boundary 
points. 
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6.1.3 Inversion Preserves Inverse Points 


Earlier, we proved the following result. Subsection 5.5.2, 


. . . . . Corollary 2 
Let A and B be inverse points with respect to a generalized circle C, and let t be 


an inversive transformation. Then f(A) and f(B) are inverse points with respect 
to the generalized circle t(C). 


We now consider a special case of this result, which we can interpret in 
terms of hyperbolic geometry. 

We take C to be a generalized circle of which part (a d-line £) lies inside the See the diagram below. 
unit disc Z. Let A be a point in Y; then its inverse with respect to ¢, B say, is 
also in Y. We take ¢ to be an inversion for which the generalized circle of inver- Every inversion is an 
sion C* is such that part (a d-line £*) lies inside Z. Then, by the above result, _inversive transformation. 
A’ = t(A) and B’ = ¢(B) are inverse points in Z with respect to the general- 
ized circle C’ = t(C), part of which lies in Y, namely the d-line ¢’ = r(£). 

We now interpret this special case in terms of hyperbolic geometry. 


Theorem 6 Let A,B € Y be inverse points with respect to the d-line ¢, 
and let A’, B’ and £’ be the images of A, B and £ under inversion (hyperbolic 
reflection) in another d-line @*. Then A’ and B’ are inverse points with 
respect to inversion in ¢’. 


Remark 


Note the convenient use of the term ‘inversion’ to mean ‘hyperbolic reflection’. 

This is consistent with the use of this term in Chapter 5. 
We shall make use of this theorem in Section 6.5, when we study the design 

on the cover of the book. Cc 
We finish this section with an illustration of the use of Theorem 6. 


Example 2 Let £ be ad-line and P be a point in Z. Let Q be the image of P 
under inversion in the circle C of which € is part. Let ¢’ be the d-line through 


P and Q, meeting @ at the point S. ge, 


(a) Show that inversion in C maps £’ to itself. 
(b) Deduce that the d-lines @ and ¢’ meet at right angles. e 
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(c) Use Theorems 5 and 6 to show that there is a hyperbolic transformation 
mapping the figure to one in which the d-line @ is the x-axis and P and Q 
are complex conjugates. 


Solution 


(a) Inversion in C exchanges P and Q. So it maps ¢’, the unique d-line 
through P and Q, to itself. 

(b) Inversion in C exchanges the angles ¢’ makes with £, and maps angles to 
angles of equal magnitude. So the angles must be equal, and since their 
sum is 7, each must be 7/2. Thus £ and ¢’ meet at right angles. 

(c) We use Theorem 5 to map the point S to O and the d-line @ to the x-axis. 
This makes the d-line ¢’ part of the y-axis. By Theorem 6, P and Q are 
mapped to points that are inverse with respect to the x-axis, and so are 
complex conjugates of each other. 


6.2 Hyperbolic Transformations 


6.2.1 Hyperbolic Transformations and Mobius Transformations 


Each element of the group Gg of hyperbolic transformations is the composite 
of a finite number of reflections in d-lines. In this subsection we shall estab- 
lish an explicit formula for a hyperbolic transformation in terms of M6ébius — Subsection 5.3.3 
transformations. 
We start by considering reflection in a d-line £ which is part of a circle C 
with centre the point R and radius r. Suppose that the d-line has boundary 
points P and Q. We let the coordinates of the point R be (a,b), and write the 
complex number a + ib as a, so that |a| > 1 (which follows because ¢ is a 
d-line). We shall need the following observation: because the d-line € is part 
of the circle with centre a and radius r, the triangle ARPO is right angled at — We shall call this d-line 


P. So, by Pythagoras’ Theorem, RP? + PO? = RO*. This implies that the ‘d-line obtained 
from @’. 


r+1=a? +b’. 
Using the fact that a? + b? = ad, we may rewrite this equation as 
r*— oa = —1, (1) 


We now use equation (1), and the fact that inversion ¢ in the circle C has 
the form 


2 
r 


t(Z) = +a (eC—{a}), 


This follows from 
Subsection 5.2.1, 
to obtain the form of the hyperbolic reflection p in the d-line @. Theorem 3. 


NX 


=a 
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Since @ is part of C, p is given by 


p(z) 


= (Z € Z), 
a 


by equation (1). Thus we have proved the following lemma. 


Lemma 1 The hyperbolic reflection p in the d-line @ that is part of a 
Euclidean circle with centre a is given by the hyperbolic transformation 
Og) = We = (ze QJ). Recall that |a| > 1. 
= 


Notice that we may write 


p(z) = (M o B)(), forz € J, 


where M(z) = = is a Mobius transformation, and B(z) = z is complex 
conjugation. You know from Chapter 5 that Mébius transformations are impor- Subsection 5.3.3, 
tant in inversive geometry: they preserve the magnitudes of angles, and they Theorem 5 

map generalized circles to generalized circles. Since angles and generalized 

circles are important in hyperbolic geometry, it is not surprising that particular 


sorts of Mébius transformations turn up here too. 


Problem 1 Find the point which has image 0 under reflection in the 
d-line obtained from a. Hence obtain a second proof of the Origin 
Lemma. 


Reflection in a d-line which is a diameter of Z is simply (Euclidean) 
reflection in that line. Recall that Subsection 5.2.1 


t(zZ) =Z 
is reflection in the x-axis, and 
1y(z) = az, 


where a = cos @ +i sin@ with 6 = Arg a, is rotation about the origin through 
the angle 6. It follows that the composite 


(1 of, 0 s") (z) = a(a@Z) = a2 For ty !(z) = az 


is reflection in the line with equation y = x tan@. 
Thus hyperbolic reflection o in the diameter on which y = x tan@ is 
given by 
o@=azZ (EY), 


where ~ = cos@ +i sin0@. 
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Example1 Find the composite 02 0 oj of the hyperbolic reflections 
oi(z) =a7Z and o2(z) = Bz, 


where a= cos@; + isin@; and B= cos@ + isin@2, and interpret the 
transformation o2 o o; geometrically. 


Solution We have 


(02 0 01)(z) = B*(a2z) = Baz, 


which is a Euclidean rotation about the origin of J. 


The composite of reflections o and o in two d-lines that are obtained from 
a and £, respectively, is found as follows. By Lemma 1, we have 


p(z) = ia =o = (M o B)(z) Recall that M(z) = ml 
Z-a and B(z) =z. 
and 
z—1 
o@= 2 7 = (Mo BG), 


where M’(z) = B = is a Mébius transformation. So 


(oo p)(z) = (M’0 BoM o B)(z). 
But 


ei | x, 
(B o M)(z) = ~— = (Mo B)(2), 
Z—-a 


where M (z) is the M6bius transformation M (= ot So 
(og 0 p)(z) = (M0 BoM o B)(z) 
= (Mo Mo Bo B)(z) = (Mo M)(z), 
because (B o B)(z) = z. 
Now matrices associated with M’ and M are ( B "3 ) and ( aie ), 


1 1 -a 
respectively. Hence a matrix associated with the composite M’ o M is Here we are using the 
7 7 strategy in Subsection 
B = a —-l\_ /( ap =1 oe B 5.3.5 for finding 
1 -£ 1 -a J) \ @a-Bp ap-1 /’ composites. 
so that the composite o o p of the hyperbolic reflections o and p is of the form 
az+b 
(9 0 p)Z@) = =—_, 
bz+a 


wherea = af —1,b=a-— 68. 
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For ease of reference, we record this last result as a theorem. 


Theorem 1 The composite of the hyperbolic reflections 


z—1 z—1 
p= %— and o@= 2 Ge” 
1b Os Z— 
is the hyperbolic transformation 
(@/3 = Ngar@ = (8 
= = = J). 
Cope) = = CED) 


Example 2 Show that the hyperbolic reflection p is its own inverse, by 
showing that p(p(z)) = z. 


Solution From Theorem 1, we know that the composite of the reflection p 
with itself is given by 
(aa —l)z+a-a _ (aa — 1)z _ 


(0° p)(z) = @=ueieect wal = 2; aa #1 since |a| > 1. 


as required. 


Example 3 Show that the composite Mz o M, of the Mébius transformations 


az+b cz +d 
M\(z) = =—— and Moz) = =—— 
bz+4 dz+c 
is a Mobius transformation of the same form. That is, of the form Fae 


: : . : : for some e, f € C. 
Solution By the strategy for finding composites of Mébius transformations, 


a matrix associated with the composite Mz o Mj is the matrix product 


c od ab\ | cat+tdb cbh+da 
d é b a) \ da+éb db+éa )° 


This product is of the required form, because 


(ca+db)=db+éa and (ch+da) =da+éb. 


It follows from this example that every composite of an even number of 
hyperbolic reflections in d-lines can be represented as a Mébius transforma- 
tion, restricted to Z, of the form 

M@= 272, ce”. 
bz+a 

Similarly, it can be shown that every composite of an odd number of reflections 
can be represented as a Mobius transformation of this form composed with We omit the details of a 
complex conjugation. proof of this. 


360 


It follows from the above discussion that any composite of a finite number of 


hyperbolic reflections, that is, any hyperbolic transformation, can be expressed 
in one of the forms 


zt» M(z) or ze MZ) (z<EY), 
where M is the Mobius transformation 
az+b 
M(z) = = -, zEQ. 
bz+a 


Moreover, since M(0) = b/a and the image of 0 under a hyperbolic transfor- 
mation must be in Y, we require that the above Mobius transformation must 
be such that 

|b| < Jal. 
The remaining question is: “Do all such Mobius transformations represent 
hyperbolic transformations?’ In fact the answer is YES, as the following 
theorem shows. 


Theorem 2 The restriction to Y of every Mobius transformation of 


the form M(z) = goth with |b| < |a| is a composite of two hyperbolic 


“ 
reflections, and is therefore a hyperbolic transformation. 


The proof is a little devious (and no less elegant for that). We first prove it 
for the special case when the Mobius transformation maps the origin to itself. 
Using this special case, we then prove the general case. 


Proof 
Case 1 The Mobius transformation M(z) = ate 
We shall show that M is the composite of two reflections. 
The condition M(0) = 0 implies that b = 0, so that the Mobius transfor- 
mation is simply M(z) = £z — a rotation about the origin. 


Now, if we let a = re'®, then 


maps the origin to itself. 


= ele 


Qi/ & 


So, if we let o; and 02 be the reflections in diameters of Z given by 


oi(z)=z, zeQ, and on(z)=e%%z, zeQ, 
then 

0 ro) 01 (Zz) = e79(Z) = e2lFz. 
so that M = o2 0 oj, as required. 


The Mobius transformation M(z) = az+P does not map the origin 


Case 2 
bz+a 


to itself. 
We shall again show that M is the composite of two hyperbolic reflections. 
Consider a hyperbolic transformation p given by p(z) = (Mo B)(z), where 
M'(z) = — is a Mobius transformation, and B(z) = z is complex conjuga- 
tion. We shall choose @ so as to make p and M o ¢ hyperbolic reflections. This 


will show that M = (Mop)op~! is acomposite of two hyperbolic reflections. 
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When the context 1s clear, 
we often omit to say that 
the Mobius transformation 
we are considering is 
restricted to J. 


Recall that reflection in 
the diameter y = x tan0 
of J is a mapping of the 
form z > az, where 

a =e!9 = cos6 +isind 
andzeé YJ. 


Hyperbolic Transformations 


Since M and M’ are associated with the matrices 


a b a —-l 
( b a ) and ( t <3 ). 


respectively, the composite M o M’ is associated with the matrix 


(5 a -l je eae SS 

b a 1 -a@ ab+a —b—aa : 

so we deduce that 

(aa + b)z — (ab +a) 

(ab + a)% —b — aa 

Now since M(0) £0, it follows that b #0. Hence we may choose a= 
— (a/b), so that 


(M o M'o B)(z) = 


ab+a=—(a/b)b+a=0. 


— lal 


Next, since |b| < |a|, we deduce that |a| = [Bl > 1, and so the transformation 
p = M' o B isa hyperbolic reflection as required. Moreover, 
(aa + b)z 


(M 0 p)(z) = (M o M’ 0 B)(z) = — 


=0 => aa 


is of the form = 5%, where y = aa + b, and so the transformation M o p is 
a hyperbolic reflection. It follows that the transformation M is a composite of 
two hyperbolic reflections, M o p and p7!. | 


A similar argument shows that the more general transformation 

az+b 

bz+a 

is a composite of three reflections. For it is the composite (M o B)(z) of the 
Mo6bius transformation 


Zk 


with complex conjugation B(z) = z. Now, the Mobius transformation M is a 
composite of two reflections, and complex conjugation B is reflection in the 
x-axis. So M o B is acomposite of three reflections, as claimed. 

Putting together all the above results, we have shown the following. 


Theorem 3 Every hyperbolic transformation can be written as a compos- 
ite of at most three hyperbolic reflections. 


Example 4 Show that the composite a2 o 0; of the two reflections in lines 
through the origin given by 0; (z) = az and o2(z) = f7Z can be written in 
the form M(z) = #242. 


bz+a 


For, ¢ is of the form 
required by Lemma 1. 
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Solution From Example 1, we know that 
(02 0 01)(z) = Braz. 


Since |a| = 1 and |6| = 1, 


and so 
(02 001)(z) = M(z), 
where 
b 
M(z) = a ~ witha = Ba andb = 0. 
Z 


be ha 


Problem2 Let M(z) = gett be a hyperbolic transformation mapping 
the origin to itself. Show that b = 0. 


Hyperbolic Rotations and Translations 


It follows from the Angle Theorem of inversive geometry that a single hyper- 
bolic reflection reverses the orientation of angles between d-lines. So a 
composite of two such transformations leaves the orientation unchanged, while 
a composite of three reverses it again. We call a hyperbolic transformation that 
leaves orientation unchanged a direct hyperbolic transformation, and one that 
reverses orientation indirect. So, by Theorem 3 above, we deduce that 


a direct hyperbolic transformation can be written as a 
composite of at most two hyperbolic reflections 


and 


an indirect hyperbolic transformation can be written as a 
composite of at most three hyperbolic reflections. 


It is possible to say more about the direct transformations. Let r; and rz be 
reflections in the d-lines €; and £2, respectively. 

First, suppose that €; and 2 intersect at some point A. (Certainly they 
cannot intersect at more than one point if they are distinct d-lines, since by 
Theorem 4 of Subsection 6.1.2 there is a unique d-line through any two points 
of Y.) Then the composition r2 o r; leaves the point A fixed, moves the points 
of Y ‘around A’, and does not alter the orientation of ZY. Such a hyperbolic 
transformation is called a (hyperbolic) rotation, and has exactly one fixed 
point in J. 

Next, suppose that the d-lines €; and £2 are parallel, that is, the generalized 
circles of which they are parts do not meet in Z but do meet at some point P 
on @. Then the composite rz o r; moves the points of ZY ‘around P’, and does 
not alter the orientation of Y; it has no fixed point in Y, but it leaves all the 
parallel lines with P as their common boundary point as parallel lines ending 


Subsection 5.1.3, 
Theorem 5 


Theorem | gives the form 
of a composite of two 
reflections. 


Hyperbolic Transformations 


at P. We can regard this as the limiting case of a rotation (about a point of @), 
and so call it a (hyperbolic) limit rotation. 

Finally, suppose that the d-lines €; and £2 are ultra-parallel, that is, the 
generalized circles of which they are parts do not meet in Y or on @. Then the 
composite r2 o 7; moves all the points of Z in one general direction, and does 
not alter the orientation of Y; but no point of Z (or @) remains fixed. Such a 
hyperbolic transformation is called a (hyperbolic) translation. 

The analogy between Euclidean geometry and hyperbolic geometry is not 
exact, however. For instance, the composite of two Euclidean translations is 
independent of the order in which they are applied, whereas the composite of 
two hyperbolic translations may not be independent of the order in which they 
are applied. 


6.2.2 The Canonical Form of a Hyperbolic Transformation 


In this subsection we show how to write a hyperbolic transformation in the 
most suitable form for applications. We shall do this only for a direct transfor- 
mation of the form M(z) = gett , where |b] < |a|. Our result is the following 
theorem. 


Theorem 4 A direct hyperbolic transformation M can be written in the 


form 
Z—m 
WK) = IK —, 
1—mz 


where K and m are complex numbers with |K| = 1 andm é€ J. 


Remark 

We call this form the canonical form of a direct hyperbolic transformation. It 
has the great advantage of showing that the transformation M maps the point 
m to the origin. 


Proof We know from the previous subsection that a direct hyperbolic 
transformation can always be written in the form 
az+b 


M(z)== —, with |b] < fal. 
bz+a 


Indeed, on dividing the expression for M(z) above and below by a, we can 


b —h 
Wga2 2 Zee). 
1-+ a 1- + 


which is of the required form, with K = a/a andm = —b/a. Since |a| = |a|, 
it follows that |K| = 1, as required. Since |b| < |a|, it follows that |m| < 1, as 


write it as 


required. | 
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See the discussion before 
Theorem 2. 
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We can now find the form of every direct hyperbolic transformation that 
maps a point m of J to the origin. We know from Theorem 4 that it must be 
of the form 


M(z) =K~——., where |K| = 1 and |m'| <1. 
1—m’z 

The condition that M(m) = 0 implies that m = m’, so we deduce that in fact 
every direct hyperbolic transformation mapping the point m to the origin is of 
the form 

—m 


M(z)=K —., 
1—mz 

The direct hyperbolic transformations that map the origin to itself are there- 
fore those for which m = 0, so they are of the form M(z) = Kz with |K| = 1. 
These are (Euclidean) rotations of the disc Z about the origin through an angle 
0, where K = cos@ +i sin@. In general, if all that is required is just one direct 
hyperbolic transformation sending a given point m to the origin, then we may 


set K = 1, and use the transformation M(z) = #52. K =1is the most 


convenient value to 
choose. 


Example 5 


(a) Find the general form of a direct hyperbolic transformation that maps the 
point xi to the origin. 
(b) Find one direct hyperbolic transformation that maps 3 to the origin. 


Solution 


(a) The general form is 


1. 
Z— 51 2z—i 
M(z) = K ————— = K —., where |K| = 1. 
1-(-si)2 e+? 
2 
(b) Taking K = 1 in the formula of Theorem 4, as suggested above, one such 
transformation is 


M(z) = 


Problem 3 Determine all the direct hyperbolic transformations that 
map the origin to the origin and the line y = x//3 to the line y = /3x. 


Problem 4 For each of the following points, determine all the direct 
hyperbolic transformations that map it to the origin. 
(a) gi (b) —3 + 3 


Hyperbolic Transformations 


The general form of the inverse of a direct hyperbolic transformation M 
mapping the point m to the origin is a direct hyperbolic transformation sending 
the origin to m. To find this inverse explicitly, we write 


M(z) =K~=— = (by Theorem 4) 
1—mz 
= Kz—Km 
~ -mz+1’ 


and make use of the fact that M is the restriction to Z of a Mobius 
transformation to write 
z+Km 


eg ea 
@O=FaK 


(2) 
where |K| = 1. 


Problem 5 Determine the general form of the inverse of the direct 
hyperbolic transformation M which maps i to the origin. 


Problem 6 Prove that any direct hyperbolic transformation M that 
maps the diameter (—1, 1) onto itself must be of the form 


x= 


M(z)=¢4 ; where m € (—1, 1). 


~l—mz 

Since an indirect hyperbolic transformation can be written as the compos- 

ite of at most three reflections, it follows that the general form of such a 
transformation is 


An argument similar to that for direct transformations shows that this 
transformation can always be written in the form 
Z—-m 


—= 9 


— mz 


Zw K where |k| = | and |m| < 1. 


We call this form the canonical form of an indirect hyperbolic transfor- 
mation. 

Theorem 4 gives the general form of a direct hyperbolic transformation that 
maps a given point m in Y to the origin. By two applications of the theorem, 
therefore, we can obtain the general form of a direct hyperbolic transformation 
that maps any given point in J to any other given point in J. 


Strategy To determine the general form of the direct hyperbolic transfor- 
mation that maps one point p in Y to another point qg in F: 


1. write down the general form of the direct transformation Mj, that maps 
p to O, and a matrix A, associated with M); 

2. write down a direct transformation M> that maps q to 0, and a matrix Az 
associated with M2; 
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By ‘restriction to Z’ we 
mean that we only 
consider those z in J. 


This form for M~! 
follows immediately from 
Subsection 5.3.6, 
Theorem 7. 


See the discussion 
preceding Theorem 2. 


If we require a particular 
direct transformation that 
maps p to q, then we may 
choose a particular My, 
and a particular Ay. 


The constant arising in 
Step 1 means that it is 
sufficient to use any 
particular transformation 
in Step 2, since the final 
general transformation 
requires only 

one constant. 
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3. form the matrix product AS Ay associated with the direct transformation 
M, ' M,, and hence write down the general form of the required direct 
transformation M, ‘om ile 


The following figure illustrates this strategy. 


— — 
M, M, 
M3! 0M, 


Example 6 ee the general form of the direct hyperbolic transforma- 
tion that maps 5 Li to? 


Solution We have already — that the general form of the direct hyperbolic | Example 5, part (a). 

transformation M, that maps 5 +i to Ois 

22 — 
z+2’ 


M(z)=K 


a matrix associated with M, is 


2K —-iKk 
ee a) 


Also, the direct transformation 


where |K| = 1; 


M2(z) = 


Example 5, part (b). 


maps 7 3 to 0; a matrix associated with M2 is 


The inverse of Ap is 
1 3 
=] — 16 


So a matrix associated with the sae direct transformation is 


mime CEA) 


4({ 8K+3i —4iK +6 
6K+4i -3iK +8 
Hence any direct transformation that maps 5 17 tg 2 z may be written in the form The fraction 7 disappears 
(8K + 3i)z + (-4iK + 6) at this point, as it is 


M(z) = - - , where |K| = 1. impl Itiple of th 
(OF CRAs + CHK 48) * whole matix. 


Distance in Hyperbolic Geometry 


Problem 7 


(a) Determine the general form of the direct hyperbolic transformation 
that maps —ti to . 
(b) Determine the direct hyperbolic transformation that also maps i to 1. 


Problem 8 Determine the general form of the direct hyperbolic 
transformation that maps 5 to 5: 


Problem 9 For each of the following pairs of points, either find a 
direct hyperbolic transformation mapping the first point to 0 and the 
second point to 5. or prove that no such transformation exists. 


(a) Ai andO = (b) 4 and? (c) 41 +) and 4(1 — i). 


6.3 Distance in Hyperbolic Geometry 


In the Euclidean geometry of the plane, corresponding to any two points there 
exists a non-negative number called the distance between the points. This is 
given by the formula 


d(z1,22) =|z1—z2|, 71,22 €C. 


In this section, we introduce an analogous formula for the distance in 
hyperbolic geometry between any two points of the unit disc J. 


6.3.1 The Distance Formula 


We begin by looking at various properties that we would expect any distance 
function d to have in any geometry whose points lie in the complex plane C. 
‘Ordinary’ Euclidean distance, for example, clearly possesses the following 
four properties. 


Properties of a Distance Function d 


1. d(z1,22) = 0 for all z; and z; 
d(z1, 22) = O if and only if zy = zo. 

2. d(z1, 22) = d(z2, 21) for all z; and z2. 

. d(Z1, 23) + d(z3, 22) = d(z1, 22) for all z1, z2 and z3. 

4. d(z1,23) + d(z3,22) = d(z1, 22) if and only if z1,z3 and 22 lie in this 
order on a line. 


Oo 


Property 1 asserts that the distance between any two points in the geometry 
is always positive, unless the two points coincide — in which case the distance 
between them must be zero. 

Property 2 asserts that the distance from z ; to z2 is the same as the distance 
from z> to z). 
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These functions fix the 
ra | 
point 5. 


So d(z1,22) = d(z2, 21). 


Property 3 is known as the 
Triangle Inequality. 
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Property 3 asserts that the distance from z, to z2 is always less than (or equal <3 
to) the distance from z; to another point z3 plus the distance from z3 to z2. In 
Euclidean geometry this property may be rewritten in the form 


lz1 — 23| + |z3 — z2| = |z1 — zal. 


Property 4 asserts that distance along a line is additive. 
However, there are some additional properties that we wish the distance, 
function in hyperbolic geometry to possess. 


Additional Properties of the Distance Function d in Hyperbolic 
Geometry 


5. d(z1, 22) = d(Z1, Z2) for all z, and z2 in F. 
6. d(z1,z2) = d(M(z), M(z2)) for all z} and zz in Y and all direct 
hyperbolic transformations M in Gg. 


Properties 5 and 6 together assert that hyperbolic transformations of the unit 
disc Y do not alter distances between points, since a hyperbolic transformation 
is either a direct transformation of the unit disc to itself (as in Property 6) or 
the composite of such a function with the conjugation function z +> z (which, e 
by Property 5, preserves distance). 

These additional properties enable us to make some useful observations 
about the form of the distance function. First, the direct hyperbolic transfor- 
mation 
f—Z1 


M:zrH = 
1—Z,z 


, where z; € J, 


maps z; to 0 and z2 to = 


the origin which sends 


If R is the rotation of the unit disc Y about Recall that any rotation of 
—z 2-21 Y about the origin is a 
2122 1—Z12z2 direct hyperbolic 

R o M is a direct hyperbolic transformation which sends z; to 0 and z2 to _ transformation 


22-21 (Subsection 6.2.1). 
1—212z2 


By Property 6 it follows that 


oie 


to the point 


, then overall the composite 


d(zi,22) =d (0 ic a 


= ). for all z1,z2 € F. 
1 — Z12z2 


This shows that the distance d(z1, z2) must be some function of the quantity 
Z~Z1 


1-21.22 
Indeed, does such a function exist? In fact, there is essentially only one ‘well- We verify this assertion in 
behaved’ function which yields these properties, and it turns out to be the Subsection 6.3.5. 
function tanh ~!, 


alone. But what function ensures that d has the Properties 1-6 above? 


Definition The hyperbolic distance d(z;,z2) between the points z; and 
Z2 in the unit disc J is defined by 


d(z1,z2) = tanh”! ( came 


1— 222 
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Of course before we can be sure that this is a reasonable definition of dis- 
tance we need to check that d satisfies all the Properties 1-6. Clearly d satisfies 
Properties 1, 2, 5, and we shall prove the other properties later in this section; 
for the moment we assume that all the properties hold, and we use them to 
explore some of the practical consequences of the definition. 

First observe that the formula for the distance between two given points is 
particularly simple in the case where one of the points lies at the origin. Indeed 
we have 


Definition The hyperbolic distance d(0, z) between the points 0 and z in 
the unit disc J is 
d(0,z) = tanh”! ((z/). 


Since |z| is just the Euclidean distance of z from the origin, this equation 
tells us that we can obtain the hyperbolic distance of a point z from the ori- 
gin by applying the inverse tanh function to the Euclidean distance of z from 
the origin. The graph of the inverse tanh function in the margin reveals two 
important characteristics of hyperbolic distance. 


1. Near the origin the graph is nearly a straight line with slope 1, so for a point 
z near the origin the hyperbolic distance of z from 0 is approximately equal 
to the Euclidean distance of z from 0. (This is analogous to the situation on 
the surface of the Earth; small portions of the Earth’s surface look flat and 
distances between its points are approximately Euclidean.) 

2. As the point z approaches the boundary of the disc Y, the hyperbolic dis- 
tance of z from the origin increases without bound. Indeed, as the Euclidean 
distance |z| tends to 1, the hyperbolic distance tanh7!(|z|) tends to oo. 
From the point of view of someone living in the hyperbolic geometry, the 
boundary points appear to be ‘infinitely far away’ — an observation that is 
consistent with the idea that parallel lines ‘meet’ on the boundary. 


With the help of a calculator we can easily use the formula 
d(0,z) = tanh" ((z\) (1) 


to calculate the hyperbolic distance of a point from the origin. 
For example, the hyperbolic distance of 5 from the origin is given by 


1 
1 — = = — -l os 
d (0. t) tanh (5) tanh~! (0.5) ~ 0.549. 


But we can also use equation (1) to calculate the hyperbolic distance 
between any two points which lie on a diameter of the disc J. 
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Since this formula is 
simpler to remember, we 
shall tend to use it more 
often than the formula for 
d(Z1,Z2)! 


tanh |(Izl) 


I 
f 
I 
I 
I 
I 
I 
I 
1 


At the end of this 
subsection we prove the 
alternative formula 


I+]z 
d(0,2) = } tog, (F444) 


which you can use if your 
calculator does not have a 
tanh function. 
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Example1 Find the hyperbolic distance between the points 0.1 and 0.2. 


Solution Here we use Property 4, which states that distances along a 

(hyperbolic) line are additive. In particular, this implies that 0 0.1 0.2 
d(0.1,0.2) = d(0,0.2) — d(0,0.1), 

sO We can write Throughout Section 6.3 


d(0.1,0.2) = d(0,0.2) — d(0,0.1) we shall work to the full 


accuracy of our calculator, 


— tanh~! ee tanh~! (0.1) but we shall record our 
results only to 3 or 4 

~ 0.203 — 0.100 decimal places. 

~ 0.102. 


Problem 1 Determine the hyperbolic distances d (0, ti) and 
d(0.8, 0.9). 


By rearranging equation (1) we obtain the formula 
|z| = tanh d(0, z). (2) 
This can be used to locate a point, given its hyperbolic distance along a 
radius of the disc Y. For example, the point a that is at a hyperbolic distance 


0.1 from the origin along the positive real axis is given by a = tanh0.1 ~ 
0.0997. 


Problem 2 The following table (plotted in the margin) illustrates how 
points bunch up towards the boundary of Y as their hyperbolic distance — = 
from 0 doubles. 0 0.197 0.380 0.664 


Find the two missing entries in the table. 
d(0,a),a > 0 0.2 0.4 0.8 16 3.2 
a 0.197... 0.380... 0.664... 


If we have to calculate the distance between two points that do not lie on a 
diameter of Y, then we can use the distance formula given in the definition. 


Example 2 Find the hyperbolic distance between the points 5 and zi : 


Solution From the definition of hyperbolic distance we have 


1 
d (4,41) = tanh! | | 2 
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We can use the distances that we have calculated above to demonstrate that 
Pythagoras’ Theorem fails to hold in hyperbolic geometry, at least without 
some rewording. For, if we consider the hyperbolic right-angled triangle with 
vertices at 0, i and 5 then the square of the hypotenuse is 


2 
d(4,4i) ~ 0.6819? ~ 0.4650, 


whereas the sum of the squares of the other two sides is We evaluated d (0. ) 
2 2 before Example 1, and 
d (0, 5) +d (0, si) ~ 0.5493? + 0.3466? ~ 0.4219. a (0, 41) in Problem 1, 


We shall see later on how Pythagoras’ Theorem can be reformulated in Subsection 6.4.3, 
hyperbolic geometry. Theorem 8 
We end this subsection by giving an alternative formula for the hyperbolic 
distance of a point z from the origin. First, observe that 
|z| = tanh d(0, z) 
__ sinhd(0, z) 
~ coshd(0, z) 
: ef (0.2) = e 40,2) 
~ e402) 4 e—d@,z) 
eta) — 
~~ e2d(0,z) 4 47 


We can solve this equation to obtain an expression for d(0, z) in terms of |z|. 
By cross-multiplication, we obtain 


e402) _ 4 = IzI (e20e) + 1), 
which implies that 
e402] — |[z/) = 14+ [el 
which is equivalent to 


o2d(0.z) — 1+ Nel 
1—|zI 


Taking natural logarithms of both sides, we deduce that 


1 
2d(0, z) = tog, ( = ey. 


1—|zI 
and so 
1 1+ ([z| This formula is useful if 
d(0,z) = 5 log, ( i ) : (3) your calculator does not 
— [zl have a tanh function. 


The formula for d(0, z) is sometimes quoted in this form. 


6.3.2 Hyperbolic Midpoints 


Having described how to calculate hyperbolic distances we now introduce the 
idea of the hyperbolic midpoint of a hyperbolic line segment. 
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d(a,m) d(m,b) 


Definition A point m is the hyperbolic midpoint of the hyperbolic line + 
a 


segment joining a and b if m lies on this segment and 7 


d(a,m) = d(m,b) = 5d (a,b). 


For simplicity, we confine our attention to midpoints of line segments that 
lie along a diameter of Y. The method depends on whether the endpoints of 
the segment lie on the same side of the origin O, or on opposite sides of O. 


Example 3 Find the hyperbolic midpoint m of the line segment which joins 
each of the following pairs of points. 


(a) gi and Zi (b) gi and —3i 


Solution First observe that 


d (0, $i) = tanh ( 
d (0, 3) = tanh"! ( 


(a) Here i and xi lie on the same side of 0, so the midpoint must lie at a 
hyperbolic distance 


iis 
ql 


) = tanh~! (0.25) = 0.255..., 


and 
ie 
ql 


) = tanh7!(0.75) = 0.973... 


0.547i is 
further 
from 0 

than 0.5i 


! (4 (0. ii) seg (0, 1‘)) ~ £(0.973 + 0.255) = 0.614 


from 0 on the radius through zi. That is, m ~ (tanh 0.614)i ~ 0.5477. 
(Notice that this is further from 0 than the Euclidean midpoint 0.57.) 


(b) Here -3i and i lie on opposite sides of 0, so the midpoint lies at a 0 
hyperbolic distance 
$ (a (0,32) -d(0, f/)) = 30.973 — 0.255) = 0.359 


from O on the radius through —ii. That is, m~ —(tanh0.359)i ~ 
—0.344i. -ii 


The following strategy generalizes the method used in Example 3. 


Strategy To find the hyperbolic midpoint of a line segment joining two 
points p,q ona diameter of Y, where | p| > |q| : 


1. calculate d(0, p) and d(0,q); 
2. if p and q lie on opposite sides of 0, then calculate 


d = 1(d@,p) —d(0,q)), 
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otherwise calculate 
d= 5(d(0, p) +d(0,q)); 


3. then the hyperbolic midpoint is the point m on the radius through 0 and 
p at a Euclidean distance tanh(d) from 0. 


Problem 3 Find the hyperbolic midpoint m of the line segment which 
joins each of the following pairs of points: 


0.5 and 0.8; —0.2 and 0.8. 


Remark 


To calculate the hyperbolic midpoint of a line segment which joins two arbi- 
trary points p,q in ZY we would use a Mobius transformation M to map p to 
0 and q to M(q). After calculating the hyperbolic midpoint m’ of the segment 
from 0 to M(q), we would then obtain the midpoint of the original segment by 
calculating M~!(m’). 


6.3.3 Hyperbolic Circles 


By analogy with Euclidean circles, we define a hyperbolic circle to be the locus 
of points which are a fixed hyperbolic distance from a fixed point. 


Definition The hyperbolic circle with hyperbolic radius r and hyper- 
bolic centre at c is the set defined by 


eee jar, 2] gO. 


It is natural to ask what a hyperbolic circle looks like to the Euclidean eye. 
In the case of a hyperbolic circle with radius r centred at the origin, it is just 
the set of points given by 


{z:d(0,z)=r, ze}; 
that is, the set 
{z: tanh! (|z]) =r, zEQY}, 


or 


{z:|z| =tanhr, ze QW}. 


This is a Euclidean circle with radius tanhr centred at the origin! Even more 
remarkable is the fact that every hyperbolic circle is a Euclidean circle. 
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Theorem 1 Every hyperbolic circle is a Euclidean circle in Y, and vice 
versa. 


Proof We have already established the result for circles centred at 0, so let 
C be any hyperbolic circle with hyperbolic centre m ¢ 0. Let the diameter of 
QZ through m meet C at the points a and b, and let K be the Euclidean circle 
which has ab as a diameter and (Euclidean) centre p. 


Hyperbolic 4) — 22m 
circle Ccentred = 
atm Cc 


Euclidean 
circle K 
centred at the 
midpoint p 
of ab 


We will show that C is a Euclidean circle by showing that it coincides with 
K. To do this let M be the hyperbolic transformation defined by 


z= 


M(z) = 7 


zEY, 


and let a’ and b’ be the images of a and b under M. 

Since M preserves hyperbolic distances, and maps m to O, it must map C 
to a hyperbolic circle C’ with centre O. But we already know that such a circle 
is also a Euclidean circle. Moreover, since the line segment ab passes through 
m, its image a’b’ passes through O and is therefore a diameter of C’. 

Also, since M is angle-preserving, and since ab meets K at right angles, it 
follows that M maps K to a circle K’ which meets a’b’ at right angles at the 
points a’ and b’. This implies that a’b’ is also a diameter of K’. It follows that 
C’ coincides with K’, and hence that C coincides with K. 

Conversely, let K be any Euclidean circle in with Euclidean centre p £0, 
and let the diameter of Z through p meet K at the points a and b. If C is 
the hyperbolic circle through a and b with hyperbolic centre the hyperbolic 
midpoint m of ab, then we can use the same argument as before to deduce that 
K coincides with C, and so we conclude that K is a hyperbolic circle. a 


The above proof shows that the Euclidean and hyperbolic centres of a circle 
lie on the same line through O. This observation enables us to write down the 
following strategy for finding the Euclidean centre and radius of a hyperbolic 
circle. 
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For the steps in the 
argument before Theorem 
1 can be reversed. 


Theorem 2, Subsection 
6.1.1 
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Strategy To find the Euclidean centre and radius of a hyperbolic circle C 
with hyperbolic centre m: 


1. find the points a,b where Om meets C; 
2. the Euclidean centre of C is the Euclidean midpoint of ab; 
3. the Euclidean radius of C is sla — bj. 


Example 4 Find the Euclidean centre and radius of the hyperbolic circle 


c= fesa(el) = 4} 


C 
Solution Here the hyperbolic centre m is the point 7 and so O 
a b 
d(0,m) = tanh7! (3) ~ 0.549. 
Since the hyperbolic radius of C is equal to 5 it follows that Om meets C 9g 


at the points a, b, where 
d(0,a) ~ 0.549 — 0.5 = 0.049, 
d(0,b) ~ 0.549 + 0.5 = 1.049. 
So 
a = tanh0.049 ~ 0.049, 
b = tanh 1.049 ~ 0.782. 
Since a and b both lie on the same side of 0, we have 


Euclidean centre = 5(a +b) ~ 0.415, 


Euclidean radius = sla — b| ~ 0.366. 


Problem 4 Determine the Euclidean centre and radius of the hyper- 
bolic circle 
c= [esa(e-B)=H) 


A similar strategy can be used to find the hyperbolic centre and radius of a 
Euclidean circle. 


Strategy To find the hyperbolic centre and radius of a Euclidean circle K 
with Euclidean centre p: 


1. find the points a,b where Op meets K; 
2. the hyperbolic centre of K is the hyperbolic midpoint of ab; 
3. the hyperbolic radius of K is 5d (a,b). 
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Example 5 Find the hyperbolic centre and radius of the Euclidean circle 
= 1| 
=7I- 


Solution Here the Euclidean centre p is the point —5i, and the Euclidean 
radius is i so Op meets K at the points a = —4i and b = —2i. Thus 


K={z:|e+}i 


d(0,a) = tanh7! (|-4i 


) = tanh~! (0.25) ~ 0.255...: 


d(O,a) = tanh7! (|-3 


) = tanh7!(0.75) ~ 0.973... —-ti 


Since a and b both lie on the same side of O, the hyperbolic centre m of K is K z 
given by 
d(0,m) = 5(d(0,a) + d(0,b)) ~ 0.614, 


so that 


hyperbolic centre m ~ —i - tanh0.614 ~ —0.547i; 


hyperbolic radius = 5|d(0, a) — d(0,b)| ~ 0.359. 


Problem 5 Determine the hyperbolic centre and radius of the 
Euclidean circle 


K 


ll 
—— 
“XN 
NX 
| 
Ae 
ll 
NI 
— 


We now use the fact that hyperbolic circles are also Euclidean circles to 


prove the Triangle Inequality property of hyperbolic distance. Property 3 of a distance 
function, Subsection 


: Deke 
Theorem 2 For all z), z2, z3 in 2: a 


OME 23) a Ee) = CE a 


Proof First let M be a hyperbolic transformation that maps z; to 0, and let 
M(z2) = b and M(z3) = c. Then Ob is a straight line. 
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Taking O as centre we draw a hyperbolic circle C2 through c to meet Ob 
at the point g; then d(O,q) = d(O,c). Similarly, taking b as centre we draw 
a hyperbolic circle C; through c to meet Ob at the point p, where d(b, p) = 
d(b,c). 


hyperbolic circle 
with hyperbolic 
centre O 


hyperbolic circle 
with hyperbolic 
centre b 


Since both circles are also Euclidean circles with centres on Ob, and since 
the circles intersect, it follows that 


d(0,c) + d(c, b) = d(0,q) + d(p, b) 
> d(0,q) + d(q,b) 
= d(0,b). 


Since M preserves hyperbolic distances, we deduce that 


d(z1,23) + d(z3, 22) = d(Z1, 22), 


as required. | 


We can deduce from the Triangle Inequality (Property 3) that in hyperbolic 
geometry the curve of shortest length or ‘geodesic’ between two points of F is 
the segment of the d-line that joins them. Rather loosely, we can express this 
fact as follows: “Distances are measured along d-lines in hyperbolic geometry.’ 


6.3.4 


In Euclidean geometry the image A’ of a point A under reflection in a line @ is 
an equal distance from the line and on the opposite side of the line, so that @ 
is the perpendicular bisector of the line segment AA’. We now prove that the 
same result also holds in hyperbolic geometry, where the distances and lines 
are hyperbolic ones. 

To state the result, we need the concept of hyperbolic line segment, or 
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d-line segment; this is just that part of the d-line through two points which 
lies between those points. 
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We have drawn this figure 
in the case that p and q lie 
between O and b. The 
argument also holds if g 
lies beyond b or p 

beyond O. 


Recall that hyperbolic 
circles are Euclidean 
circles, by Theorem 1. 


Property 6, Subsection 
6.3.1 


We omit a formal 
definition of ‘length of a 
curve’, and so a proof of 
this claim. 
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Theorem 3 Let A and A’ be points in the unit disc J that are images 
of each other under reflection in a d-line £. Then @ is the hyperbolic 
perpendicular bisector of the hyperbolic line segment AA’. 


Proof Let the d-line ¢’ through A and A’ meet ¢ at P. 

Now A and A’ map to each other under hyperbolic reflection in ¢, and P 
remains invariant. But hyperbolic reflection maps d-lines to d-lines, and there 
is exactly one d-line through A and A’. Thus ¢’ must map onto itself under the 
hyperbolic reflection in £, and the hyperbolic line segment PA must map onto 
the hyperbolic line segment PA’. 

But hyperbolic reflection preserves angles and lengths. Since lengths are 
preserved, it follows that PA and PA’ must be of equal hyperbolic length. 
Also, since the angles that PA and PA’ make with the same part of @ at P map 
onto each other, they must be equal too; since they must add up to a total angle 
zt at P (because APA’ is a hyperbolic line segment through P), it follows that 
PA and PA’ both meet ¢ at right angles. a 


We already know one case where we can find the hyperbolic perpendicular 
bisector of a hyperbolic line segment AA’. This is the case where one of the 
vertices, say A’, coincides with O, the centre of the disc Z. In that case, the 
reflection that sends A (represented by the complex number ) to O is given 
by M(z) = 4251, for some a = a + ib, with |a| > 1. 


The equation of the d-line obtained from @ in which the reflection takes 
place is, as we saw earlier, 


x? 4 y? — 2ax —2by +1=0. 


Since M(B) = 0, it follows from the formula for M that wf — 1 = 0. 

More generally, suppose we want to find a hyperbolic reflection that 
exchanges the points p and qg in Y. Again, the hyperbolic reflection that does 
this must be of the form 


az—1 
M(z) =-— —; 
Z-—a 
The condition M(p) = g implies 
_ apa 
= Se 
which is equivalent to 
1+ pg =aq+ap. (4) 
The condition M(q) = p implies 
aq-1 
| re 
q-a 


which is equivalent to 
1+ Gp =ap tag. (5) 


By Corollary 1, 
Subsection 6.1.1 


Subsection 6.2.1, 
Lemma | 


ea=a+ ib 
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Equations (4) and (5) can be solved for w provided that pp — gq # 0. Indeed, 
if we subtract g times equation (5) from p times equation (4), and then divide 
by pp — qq, we obtain 
gh TE PI =a) 
PP— 44 

If pp — qq = O, then |p| = |q|, in which case the sought-for reflection 
is simply Euclidean reflection in the diameter of Z which bisects the angle 
ZpOgq. 

We summarize this discussion in the following lemma. 


Lemmal_ Reflection Lemma 
Let p and q be points in the unit disc J. If |p| 4 |q|, then the hyperbolic 
reflection that maps p and q onto each other is given by 
=] a — 
Mz) = ats = Whenreon— v ge a 
zZ—a PP—44q 


The d-line in which this reflection takes place has equation 


oy = ax ye = 0) where a =a + ib. 


Example 6 Determine the equation of the hyperbolic perpendicular bisector 
of [0.2, 0.9], the line segment from 0.2 to 0.9. 


Solution Using the Reflection Lemma, with p = 0.9 and q = 0.2, we 
find that 
_ 0.7 +0.9+02+07 
~ 0.81 — 0.04 
So the equation of the d-line which is the (hyperbolic) perpendicular bisector 
of [0.2, 0.9] is 


~ 1.0727. 


x? 4 y? —2ax+1=0, wherea ~ 1.0727. 


Problem 6 


(a) Determine the equation of the hyperbolic perpendicular bisector ¢ of 
the line segment [0.5, 0.8]. 

(b) Deduce the equations of the hyperbolic perpendicular bisectors of 
the line segments [—0.8, —0.5] and [0.57, 0.87]. 


Problem 7 


(a) Let A be a point in Y, € ad-line through A, and 0 < 6 < z. Prove 
that there are exactly two d-lines through A that make an angle 0 
with €. 
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Notice that in the special 
case when q = 0, this 
formula gives a = 1/p, as 
it should. 


By Lemma 1, 
Subsection 6.1.1 


We often write [z1,z2] to 
mean the Euclidean line 
segment in C with 
endpoints z1, Z2. 
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(b) Prove that, if AABC is a d-triangle, then there is a unique d-line 
£ that bisects ZBAC, and that reflection in € maps the d-lines 
containing BA and CA onto each other. 


6.3.5 Proofs 


In Subsection 6.3.1 we asserted that the hyperbolic distance between two 
points is invariant under hyperbolic transformations. We now supply a proof 
of this fact. 


Theorem 4 The formula for hyperbolic distance 


d(z1,29) = tanh! ( pam’ 


= , where z1,2Z2 € J, 
1 — 2122 


satisfies Property 6; that is, See Subsection 6.3.1. 
d(z1,22) = d(M(z1), M(z2)) 


for all z; and zz in Y and all direct hyperbolic transformations M in Gg. 


Proof First, we define the expression R(z1, Z2) to be 


22 — Z1 


R(Z1, 22) = , where z1,22 € F. (6) 


1 — 2122 


It follows from the definition of d(z1, z2) above that We omit the details of 
this, for brevity. 


d(z1,22) = tanh” '(R(Z1,22)); 
hence d possesses Property 6 if we can prove that 
R(Z1, 22) = R(M(z1), M(Z2)) (7) 


for all z1,z2 € Dandall M € Gg. 
So, let z1,z2 € Mand M € Gg. Then the direct hyperbolic transformation 
M, in Gg given by 


Mi :zb — ZED, 
1—Z1z 
maps z 1 to 0, and z2 to ace 
1 — 2122 
Also, the direct hyperbolic transformation M2 in Gg given by 
—M 
eee Te Gi) J, 
1— M(z1)z 
where MeGzg so that |M(z1)|<1, maps M(z1) to 0, and M(z2) to 


M(z2)—M(z1) 
1—M(z1)M(z2)° 
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Now, the composite mapping Mz70 MoM, ' is also a transformation in G Q- 
It maps J to F, 0 to 0, and 


M(z2) — M(21) 

1 — M(z)M(z2) 
But any direct transformation in Gg that maps 0 to 0 is simply a (Euclidean) 

rotation; so it follows from equation (8) that 

_ | M@2) — M1) 

= MGM) 
It follows from the definition of R that we can rewrite equation (9) in the 

desired form 


22 — Z1 


(8) 


1 —Z12z2 


£2 = £1 


(9) 


1 — 2122 


R(z1, 22) = R(M(z1), M(z2)). a 


Next, we stated earlier that the function d that we used to define the hyperbolic 
distance function in Y was ‘essentially’ the only ‘well-behaved’ function with 
Properties 1-6 of Subsection 6.3.1. We now explain why this is so. 


Theorem 5 Let d(z1,z2) be any ‘well-behaved’ function defined for all 
Z1,z2 € Y that satisfies Properties 1-6 of Subsection 6.3.1. Then 


Se oll 


d(z1, 22) = K tanh! ( (10) 


— " for some K > 0. 
1—Zz2 


Proof Let z1,z2 € Y. Since hyperbolic distances are invariant under Mobius 
transformations, d(z1, z2) is unchanged if we map z, to the origin by the direct 
hyperbolic transformation (a Moébius transformation) 


4 £1 


Zh 7 
Next, hyperbolic distances are invariant under rotations of Y, since such rota- 
tions are hyperbolic transformations. So, we may also assume that z2 lies on 
the positive real axis. 

Hence, in order to prove the formula (10) it is sufficient to prove that 


d(0,z) = K tanh7!(z), for some K > 0 and all z € (0, 1). dl) 


For simplicity, we will now simply write d(z) in place of d(0, z) whenever the 
context means that no confusion will arise. 

Let 0 < a,c < 1. Then the direct hyperbolic transformation (a Mébius 
transformation) 


Z+a 
M(z) = eee, zEG 
maps 0 to a, and c onto b = ae Since M maps [0, 1) one-one to [a, 1), we 


must have a < b < 1. 
Now, hyperbolic distances are additive along a line, so that 


d(b) = d(a) + d(a,b). (12) 
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Note that M . maps 0 to 
22 21 


Z1, and to z2. 


1 — z,z2 
See Case | of Theorem 2, 


Subsection 6.2.1. 


We defined R in 
equation (6). 


By Property 6 


By Property 4 
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But, since hyperbolic distances are invariant under hyperbolic transformations, 
we have 


d(a,b) = d(0,c) 
=d(c), 
so that it follows from equation (12) that 
c+a 
d =d d(c). 
(*) (a) + d(c) 


Now we will assume that the function d is differentiable on [0, 1). If we then 
differentiate both sides of equation (13) with respect to the variable c, we get 


d! cta (1+ac)-1—(c+a)-a 
1+ ac (1 + ac)? 

and we may rewrite this equation in the form 

d' ct+a - (1 + ac)? 

l+ac 1—a? 

If we substitute 0 for c into this equation, we get 
d'a(0) 
1—a? 


(13) 


=0+d'(o); 


-d'(c). 


d'(a) = 


K 
= ——., for some real K. 
1-—a? 


Since d must be an increasing function on [0, 1), we must have K > 0. But 
we cannot have K = 0, as it would then follow from the above equation that 
d'(a) = 0 for alla € [0, 1) - so that d(0, a) takes the same value (which would 
have to be 0) for all a € [0, 1). Hence 


d'(a) = for some K > 0, and alla € (0, 1). 


K 
1—a?’ 
If we then integrate both sides of this formula from 0 to z, we get 


d(z) = K tanh7!(z), for some K > 0, and all z € [0, 1). Oo 


Remarks 
1. For any (complex) point z near the origin, equation (10) gives 
d(0,z) = K tanh“! (|z|) 
~ K\z\, 

so it is natural to make the choice K = 1 for the definition of distance in 

hyperbolic geometry. Then d(0,z) = tanh™!(|z|), for all z € Z. 
2. When d(0,z) = tanh7!(\z|), z € Z, we can reformulate the equation 

1 


d'(z) = ez in the form 


dz 
as = 22 
1 = |z| 
relating infinitesimal hyperbolic distances ds to infinitesimal Euclidean 
distances |dz| in ZY near any point z € J. 
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By Property 6 


In this equation, a and c 
are independent variables. 


This is where we use the 
hypothesis that the 
distance function is 
‘well-behaved’. 


Here d’p (0) is the right 
derivative of d at 0. 


This follows from 
Property 1. 


In advanced texts, the 
formula for hyperbolic 
distance is often used in 
this infinitesimal form. 
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6.4 Geometrical Theorems 


6.4.1‘ Triangles 


A triangle in hyperbolic geometry, or a d-triangle, consists of three points 
in the unit disc Y that do not lie on a single d-line, together with the seg- 
ments of the three d-lines joining them. One or two of the sides may be 
segments of diameters of Z; but, in general, the sides are parts of Euclidean 


circles. 


Notice that if AABC is a d-triangle, then the extended sides can meet only 
at A, B and C. For if the d-lines through A and B and through B and C meet 
at a point P distinct from B, then the two d-lines meet at B and P. But we saw 
earlier that there is a unique d-line through any two points of Y; so it would 
follow that AB and BC are part of a single d-line, a possibility that we have 
already excluded. So the sides of a d-triangle cannot ‘overlap’. 

One of the basic results in Euclidean geometry is that the sum of the angles 
of a (Euclidean) triangle is 2. This is not true for d-triangles in hyperbolic 
geometry! 


Theorem 1 The sum of the angles of a d-triangle is less than z. 


Proof By the Origin Lemma, we can map the d-triangle AABC onto a 
d-triangle AOB’C' by any hyperbolic transformation that sends A to the origin 
O. Since hyperbolic transformations preserve angles, the sums of the angles 
of the two d-triangles are the same. 

Then OB’ and OC’ are parts of Euclidean lines, and B’C’ is part of a 
Euclidean circle that ‘bends towards’ the origin. Thus the angles at B’ and 
C’ of the d-triangle AOB’C’ are less than the corresponding angles of the 
Euclidean triangle AOB’C’, and the angles of both triangles at O are the 
same. 

Since the sum of the angles of the Euclidean triangle AOB’C’ is zr, it follows 
that the sum of the angles of the d-triangle AOB’C’ is (strictly) less than zr. 
The result then follows. oO 


Problem 1 Prove that each external angle of a d-triangle is greater 
than the sum of the opposite two internal angles. (See the diagram on the 
left below.) 


Subsection 6.1.2, 
Theorem 4 


Subsection 6.1.2, 
Lemma 2 
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Problem 2 Let AABC and AABC' be two hyperbolic triangles such 
that A,C and C’ lie on a d-line in this order. Let the angles at A, B,C 
in the first triangle and at A, B,C’ in the second triangle be a, 8, y and 
a, B+ B’, y’, respectively. Show thata+B+y >a+B+f'+y’. (See 
the diagram on the right, above.) 


It will be seen from Problem 2 that the larger triangle, AABC’, has the 
smaller angle sum. This is true in general, not only when one triangle fits 
neatly inside another. In fact, it can be shown that the area of a hyperbolic 
triangle with angles a, 6 and y is proportional to 7 — (a + 6 + y), and we 
shall prove this in Subsection 6.5.1. You may for the moment use the following 
result as a useful memory aid: 


small triangles have angle sums close to (but less than) zr, and triangles with 
large areas have angle sums close to zero. 


In Euclidean geometry, we can also prove that the sum of the angles of a 
(Euclidean) quadrilateral equals 277. Now, a quadrilateral ABCD in hyper- 
bolic geometry, or a d-quadrilateral, consists of four points A, B,C, D in F 
(no three of which lie on a single d-line), together with the segments AB, BC, 
CD and DA of the four d-lines joining them. We require also that no two of 
these segments meet except at one of the points A, B, C or D. 


How large can the sum of the angles of a d-quadrilateral be? 


Theorem 2 The sum of the angles of a d-quadrilateral is less than 277. 


We shall assume this fact 
Proof Any d-quadrilateral can be divided into two (non-overlapping) without proof; see the 
d-triangles by one or other of the d-lines joining alternate vertices. figures below. 


Geometrical Theorems 


Cc 


The angles of each d-triangle sum to less than 2. The angles of the 
d-quadrilateral belong to one or other d-triangle, or partly to one and partly 
to the other. It follows that the angles of the d-quadrilateral sum to less 
than 277. Oo 


In hyperbolic geometry there are many theorems about d-triangles that are 
analogues of the corresponding theorems about Euclidean triangles, such as 
the following. 


Theorem 3 Let AABC be a d-triangle in which ZABC = ZACB. Then 
the sides AB and AC are of equal length. 


Proof Let D be the midpoint of the d-line segment BC. By applying the Ori- 
gin Lemma, if necessary, we may assume that D coincides with O, the centre 
of the disc Y. (Although this is not strictly necessary for the proof, it simplifies 
the picture.) Then BC is part of a diameter of J. 

Let the d-line @ be the perpendicular bisector of BC; it is the diameter of 
QZ perpendicular to BC. Reflect (in both the Euclidean and hyperbolic senses) 
the triangle AABC in the d-line ¢. Because reflections preserve length, and 
DB= DC, it follows that B and C change places. Suppose that A moves to 
some point A’. Since reflection preserves angles, it follows that ZA’BC = 
ZACB. Also, recall that we are given that ZACB = ZABC, so ZA'BC = 
ZABC. But this can happen only if A’ lies on the d-line through A and B. 
Similarly, 7A’CB = ZABC = ZACB, so A’ must also lie on the d-line through 
A and C. This means that A and A’ must coincide. Hence the d-line segment 
BA reflects to the d-line segment CA, and vice versa; so these d-line segments 
have the same length. | 


Problem 3 Let AABC be a d-triangle in which the sides AB and AC 
have equal hyperbolic length. Prove that ZABC = ZACB. 
Hint: Consider reflection in the d-line that bisects angle Z BAC. 


The notion of mapping one figure onto another exactly by a transformation 
in the geometry (for example, the reflection in the d-line ¢ in the proof of 
Theorem 3) is one that we met previously: namely, congruence. 


Definition Two figures in the unit disc Y are d-congruent if there is a 
hyperbolic transformation that maps one onto the other. 
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We shall use this fact later. 


This is the hyperbolic 
analogue of the Euclidean 
result in Example 1, 
Subsection 2.1.1. 
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For example, if A, B, C and D are four points in Y such that the hyperbolic 
distances d(A, B) and d(C, D) are equal, then the d-line segment AB is con- 
gruent to the d-line segment CD. For, as we saw earlier, there is a hyperbolic 
transformation ¢ that maps A to C and the d-line through A and B onto the 
d-line through C and D, in such a way that D and #(B) lie on the same side of 
C along t(£). Then, since B and D are the same hyperbolic distances from A 
and C, respectively, it follows that t must map B onto D. Thus AB and CD are 
congruent. 

Now, since hyperbolic transformations preserve angles, it follows that if 
two d-triangles do not have corresponding angles equal (that is, they are not 
similar), then they certainly cannot be d-congruent. 

However, the following result is still very surprising, because the analogous 
result is false in Euclidean geometry. 


Theorem 4 Similar d-triangles are d-congruent. 


Proof We have to prove that if the d-triangles AABC and APQR have the 
angles at A, B and C and the angles at P. Q and R equal, respectively, then the 
two d-triangles are d-congruent. 

We may apply a hyperbolic transformation to map A to the origin O; this 
does not change the angles of the triangle. To avoid complicated notation, we 
shall still denote this image d-triangle by AABC. 

We can also apply a hyperbolic transformation to map P to the origin and the 
radius on which Q lies to the radius on which B lies. By reflecting in the d-line 
through O and B, if necessary, we can deduce from the fact that the angles at 
A and P are equal that the image of R lies on the same radius as (the image of) 
C. Again, we shall denote the image d-triangle still by the notation APQR, for 
simplicity. 

The following figure shows the result of the transformations described 
above. 


R lies 
on this 
radius 


To prove the result, we have to show (in terms of the points obtained by the 
above preliminary mappings) that B = Q and C= R. We proceed by considering 
the various possible situations that would arise if the result were false. 


(a) The d-line segment BC might lie between the origin and the d-line segment 
OR. (The argument is similar if QR lies between O and BC.) 
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Subsection 6.1.2, 
Theorem 5, and the 
Remark following that 
theorem. 


Recall that two Euclidean 
figures are similar if one 
is a scale copy of the 
other; in particular, two 
Euclidean triangles are 
similar if they have 
corresponding angles 
equal. 


Geometrical Theorems 


In this case, 
ZRCB=x—ZOCB and ZCBQ=2 — ZOBC, 
so the angle sum of the d-quadrilateral CBQR is 
(x — ZOCB) + (x — ZOBC) + ZOQR+ ZORQ = 22, 
which is impossible, by Theorem 2. 
(b) Band Q may coincide, but not C and R. (The argument is similar if R lies 
between O and C, or if C and R coincide, but not B and Q.) 

In this case, if C lies between O and R, the external angle at C of the 
d-triangle ABCR is less than or equal to the sum of the opposite two 
internal angles, which is impossible by the result of Problem 1. 

The d-line segments BC and QR may cross. 

Let the point of intersection of BC and QR be X. In this case, the exter- 
nal angle at B of the d-triangle AXBQ is less than or equal to the sum 
of the opposite two internal angles, which is impossible by the result of 
Problem 1. 


It follows that the only possibility is that B = Q andC = R,asrequired. 


(c 


wm 


In the following diagram, all the d-triangles have the same angles at 
their vertices, and so are similar. It follows from Theorem 4 that they are 
d-congruent to each other. 
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This looks very unlikely, since the triangles ‘seem’ to be getting smaller as 
they move away from the origin towards @; however, it is a consequence of 
the way in which we defined both hyperbolic transformations and hyperbolic 
distance that all the triangles have sides of the same (hyperbolic) lengths as 
well as angles of the same sizes. This results from the fact that equal Euclidean 
distances on a ruler correspond to increasing hyperbolic distances as the ruler 
is moved outwards. 
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Asymptotic Triangles 


The vertices of d-triangles lie in the unit disc ZY. However, it is often useful to ‘eitiaeaieate . 
talk about figures in Y with three sides that are d-lines but where one or more ibaa ti 
of the Euclidean circles or lines of which they are part meet on @ rather than 

in Y. In this sense, we say that they are triangles ‘with one or more vertices on 

@’; such triangles are called asymptotic triangles. 

If an asymptotic triangle has one vertex on @ is said to be simply asymp- 
totic, if it has two vertices on @ doubly asymptotic, and three vertices on @ 
trebly asymptotic. e 

There is no hyperbolic transformation that maps points of @ to points of J, 
or vice-versa; so asymptotic triangles are essentially different from ‘ordi- 
nary’ d-triangles. However, it turns out that many of the results that hold for 
d-triangles hold also for asymptotic triangles, and that their proofs are similar. 


simply asymptotic 


Theorem 5 The angle sum of an asymptotic triangle is less than z. The We omit the proof of this 


angle sum of a trebly asymptotic triangle is zero. result. The proof of the 
first assertion is similar to 
that of Theorem 1. 


Problem 4 Prove that two doubly asymptotic triangles are 
d-congruent if and only if they have the same angle at their vertex in J. 


In hyperbolic geometry you should be careful not to assume that results are 
valid simply because they hold in Euclidean geometry. Sometimes asymptotic 


triangles are useful in constructing counter-examples. We shall do exactly this at 
the end of 
6.4.2 Perpendicular Lines Subsection 6.4.2. 


In Euclidean geometry, two given lines @ and ¢’ have a common perpendicular 
if and only if they are parallel to each other. In hyperbolic geometry, the sit- 
uation is somewhat different, as you will see. First, we must define the term 
common perpendicular in hyperbolic geometry. 


Definition Let ¢ and ¢’ be two d-lines, and suppose that there exist points 
A on £and A’ on ¢’ such that the d-line segment AA’ meets £ and ¢’ at right 
angles. Then AA’ is a common perpendicular to £ and @’. 


Theorem 6 Common Perpendiculars Theorem 
Two d-lines have a common perpendicular if and only they are 
ultra-parallel. This common perpendicular is unique. e 


Proof First we show that if two d-lines € and ¢’ have a common perpendic- 
ular, then they are ultra-parallel. Let the common perpendicular be the d-line 
segment AA’, where A is on @ and A’ is on ¢’. By the Origin Lemma, we can 
find a transformation r € Gg which maps the point A to the origin, O. Then 
the d-line r(£) is a diameter of the disc Y, as shown on the left below. The 
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d-line r(¢’) is part of a Euclidean circle whose centre R lies somewhere out- 
side the disc Z on the continuation of the diameter joining O to r(A’). The 
radius of this Euclidean circle is less than RO, and RO is perpendicular to r(¢), 
so this circle cannot meet r(£), and hence r(€) and r(¢’) are ultra-parallel. It 
follows that the d-lines @ and ¢’ are ultra-parallel. 


Secondly, we show that if two d-lines ¢ and €’ are ultra-parallel, then they 
have a common perpendicular. As shown in the figure on the right above, let 
the boundary points of € be A and B, and the boundary points of ¢’ be A’ and 
B’. Consider the Euclidean lines AB and A'B’. 

If the Euclidean lines AB and A’ B’ are not parallel, then they meet at a point 
R outside Y, as shown on the right above. Let RT be a tangent from R to the 
boundary circle @. Consider the circle C with centre R and radius RT. This 
circle meets @ at right angles, so the part of it in J is a d-line, €” say. 

The Euclidean triangles ARTB and ARAT are similar, since ZTRB = ZART 
(being the same angle) and 7RTB = ZRAT (since the exterior angle equals the 
interior opposite angle for a tangent and chord of a circle). Hence, in particular, 
RB/RT = RT/RA so that RA - RB = RT”. In other words, A and B are inverse 
points with respect to the circle with centre R and radius RT. 

Inversion in C exchanges A and B, and exchanges A’ and B’. So it maps the 
d-line £ to itself, and the d-line @’ to itself. It follows that @” intersects £ and ¢’ 
at right angles, and so a common perpendicular to the d-lines € and £’ exists. 


390 6: Hyperbolic Geometry: the Poincaré Model 


If the Euclidean lines AB and A’ B’ are parallel, then there is a diameter ¢” of 
Q that is the perpendicular bisector of AB and A’ B’. It therefore passes through 
the centres R and R’ of the Euclidean circles of which @ and ¢’ are parts, and 
so it is perpendicular to both ¢ and ¢’. Since €” is a diameter of J it is a d-line, 
and the common perpendicular we seek exists. 

Finally, the common perpendicular to two ultra-parallel lines is unique. For, 
if there were two, and they were disjoint (as shown on the top illustration 
alongside), they and the two ultra-parallel lines would form four sides of a 
hyperbolic quadrilateral all of whose angles were right angles. But then the 
angle sum of such a quadrilateral would be 277, which is impossible. Similarly, 
if there were two common perpendiculars that intersected each other (as shown 
in the lower illustration alongside), we should have a d-triangle whose angle 


sum was > 7, which is impossible. 
It follows that the common perpendicular to two ultra-parallel lines is 
unique. | 


In Euclidean geometry, the altitudes of a triangle play an interesting role; 
this is also the case in hyperbolic geometry. 

Before we can talk about altitudes, we need to prove that through any point This fact is needed in 
P not on a given d-line ¢ there is necessarily another (unique) d-line p such order to prove that 


that p meets £ at right angles. altitudes of a given 
d-triangle exist. 


By a preliminary hyperbolic transformation, if necessary, we may assume 
that P is the origin. Let £ have boundary points A and B, and let R be the 
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centre of the Euclidean circle of which @ is a part. Let the (Euclidean) line OR 
meet ¢ at X; OR and £ intersect at right angles at X, so that the d-line segment 
OX is a perpendicular from O to ¢. The diameter which includes OX meets 
at right angles, as required. It is unique because no other line through O can 
meet ¢ at right angles. 

Thus it is possible to ‘drop’ a perpendicular from a point to a d-line, and so 
it makes sense to talk about the altitude through a vertex of a d-triangle. 


Definition Let ¢ be a d-line which passes through one vertex A of a 
d-triangle AABC and which is perpendicular to the side BC at the point D. 
The d-line segment AD is an altitude of the triangle AABC. 


Theorem 7 Altitude Theorem 

Let the sides AB and AC of a d-triangle AABC be of equal hyperbolic 
length, and let the angle at A be 0. Then the hyperbolic length of the altitude 
through A of the triangle is less than some number that depends only on 0. 


Proof Map the d-triangle AABC onto a d-triangle AOB’C’ by any hyperbolic 
transformation that sends A to the origin O. Since hyperbolic transformations 
preserve (hyperbolic) lengths and angles, OB’ and OC’ are of equal length and 
ZB'OC' =8. 

Let the diameter m of J that bisects the angle 7B’OC’ meet the d-line 
segment B/C’ at D’. It is clear, by symmetry, that the d-line segment OD’ 
must be perpendicular to the d-line segment B/C’, so that OD' (which is part 
of a Euclidean line) is the altitude of the d-triangle AOB’C’ through O. Since 
d-triangles have a unique altitude through each vertex, the altitude AD of the 
d-triangle AABC must map onto the altitude OD’, and so they are of equal 
hyperbolic length. 

Then the length of OD’ is less than the length of OD’, where D” lies on 
m and the d-line joining the boundary points B” and C” of the lines OB’ and 
OC’ (as shown). It follows that the hyperbolic length of the original altitude 
AD is less than the hyperbolic length of OD”, which clearly depends only 
on @. Oo 


Problem 5 Prove that the d-triangles AOB’D’ and AOC’D’ in the 
proof of Theorem 7 are d-congruent. 


Problem 6 Determine a (numerical) upper bound for the hyperbolic 
length of the altitude AD of an isosceles d-triangle AABC in which AB 
and AC are of equal hyperbolic length and the angle at A is a right angle. 
Hint: This situation is a special case of that in Theorem 7: 6 = 1/2, 
so B” and C” in the proof of that theorem are | and i, respectively. 


In Euclidean geometry the altitudes of a triangle are concurrent; but this is 
not true in general in hyperbolic geometry. For example, let A, B and C be 
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the points —1,0 and ee Then the altitudes through A and C of the d-triangle 
4 : . : Iti 1-i : 

AABC are the d-lines with endpoints —1, —i and Ja? JB? respectively. These 


d-lines do not meet anywhere in Y or on @. So certainly the altitudes of a 
doubly asymptotic triangle are not concurrent. 

Finally, if we choose a (non-asymptotic) d-triangle with one vertex at O 
and the other two vertices in Z but very close to A and C, it is clear that the 
altitudes of this d-triangle cannot be concurrent. 


6.4.3 Right-Angled d-Triangles 


In Euclidean geometry, Pythagoras’ Theorem plays a central role in many 
calculations. As we have seen, however, the theorem does not hold in hyper- 
bolic geometry if we simply replace Euclidean lines by d-lines and Euclidean 
distances by hyperbolic distances in its statement. 

However it seems reasonable to imagine that hyperbolic functions might 
play some role in a version of Pythagoras’ Theorem in hyperbolic geometry, 
and indeed this is the case. 


Theorem 8 Pythagoras’ Theorem 
Let AABC be a d-triangle in which the angle at C is a right angle. If a,b 
and c are the hyperbolic lengths of BC, CA and AB, then 


cosh 2c = cosh 2a x cosh 2b. 


Problem 7 Use Pythagoras’ Theorem to determine the hyperbolic 
lengths of the sides of the d-triangle with vertices 0, (1 + i)/2/2 and 
(—1+i)/5/2, which has a right angle at O. 


Problem 8 Use Pythagoras’ Theorem to determine the hyperbolic 
lengths of the sides of the d-triangles with vertices at the following 
points: 


(a) 0, ; and 3i; (b) 0,r and ir (where 0 < r < 1). 


We can use these ideas to provide another solution to Problem 6 above. 


Example 1 Prove that, if AABC is an isosceles d-triangle in which AB and 
AC are of equal hyperbolic lengths and the angle at A is a right angle, then 


the hyperbolic length of the altitude AD is less than i cosh7! (v2) ~ 0.4407 
(which is 3 log, (1+ ¥2)). 


Solution The given triangle is d-congruent to the d-triangle AOB’C’ with 
vertices at 0,r and ir, for some r with 0 < r < 1. Let OD’ be an altitude of 
this triangle. Let the hyperbolic lengths of the sides OB’ and OC’ be denoted 
by b, of side B’C’ by a, and of the altitude OD’ by d. 
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Paragraph following 
Example 2 of Subsection 
6.3.1. 


We prove Theorem 8 later 
in this subsection. 


In Problem 6 we found 
that this upper bound 


I cosh~! (v2) was equal 


to tanh~! 2s i); in 
fact the two numbers are 
equal. Such unexpected 
identities involving 
hyperbolic functions 
occur surprisingly often! 
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It follows from Problem 5 (or by symmetry) that D’ is the Euclidean mid- 
point and the hyperbolic midpoint of the d-line segment B’C’, so that the d-line 
segments B’D’ and C’ D’ both have hyperbolic length 5a. 

By applying Pythagoras’ Theorem to the d-triangle AOB’ D’, we find that 


cosh 2b = (cosh2d) x (cosh2 (52)) 


= (cosh 2d) x cosha. (1) 


Next, by applying Pythagoras’ Theorem to the d-triangle AOB’C’, we find 
that 


cosh 2a = cosh? 2b. (2) 
Now, we may rewrite equation (2) in the form 
2cosh? a — 1 = cosh 2b; (3) 


and if we then substitute for cosha from equation (1) into equation (3), 
we obtain 


cosh 2b 
cosh 2d 


We may rearrange this equation in the form 
(co ay 7 2 
cosh2b) ~— 1+ cosh? 2b’ 
/2 cosh 2b 


(1 + cosh? 2b) 
xf 


1 
(1 + cosh? =) 
< V2. 


It follows that d < 5 cosh7! (v2) ~ 0.4407, as claimed. 


2 
) — 1 =cosh? 2b. 


so that 


cosh 2d = 


Pythagoras’ Theorem takes a simple, if unexpected, form in hyperbolic 
geometry. Moreover, the relationship closely approximates the Euclidean one 
if the sides of the triangle are very small, which is what you would expect 
because small hyperbolic triangles have angle sums nearly equal to z and so 
are nearly Euclidean themselves. To see the relationship, we expand the cosh 
terms as power series, using the formula 

(2c)? 


cosh2c = 1+ 71 


+ higher powers of 2c. 


If we ignore the higher powers of 2c, as we may when 2c is very small, 
we find 
cosh 2c ~ 1 + 2c* 
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Here we use the identity 
cosh? 2x = 2cosh? x — 1, 
forx eR. 


Recall that 


ay x2 x4 
coshx = Fag tae ee 


for allx eR. 
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and 
cosh 2a x cosh 2b = (1 + 2a”) (1 + 2b”) 
= 1+ 2a? + 2b* + 4a’b’. 
If we ignore the term 4a*b on the grounds that it is as small as other terms we 
have already dropped, then we are left with 
142c? ~ 1+42a* + 2b’, 
which reduces to 
ora’ +2, 

So for ‘small’ d-triangles the hyperbolic version of Pythagoras’ Theorem is 

essentially the same as the Euclidean version. 


In Euclidean geometry there is a simple connection between the lengths of 
the sides of a right-angled triangle and its angles. An analogous result exists in 


hyperbolic geometry also. 


Theorem 9 Lobachevskii’s Formula 
Let the d-triangle AABC have a right angle at C, and let the hyperbolic 
lengths of sides AC and BC be b and a, respectively. Then 


ee tanh 2a We prove this result later 
eae sinh 2b° in the subsection. 


For example, let the points A, B,C be 7 xi and 0, respectively. Then it 


follows from Lobachevskii’s Formula that 


Rl 


tanh 2d 


sinh 2d 


2 
2 
S 


2 ——_ & 0 
1.6667 
so that A ~ tan—! 0.6 ~ 0.5404 radians. 


Remark 

It can be shown that when the triangle is very small, and so nearly Euclidean, 
tanh2a is approximately 2a and sinh2b is approximately 2b, so that 
tan A ~ a/b, which is the Euclidean value. 
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Problem 9 Determine the angle at A in the d-triangle AABC with 
3 


vertices — zi : -3 and 0, respectively. 

We can also use these ideas to answer another question that loomed very 
large in the discovery of hyperbolic geometry. 

Let £’ be the d-line from a point P that is perpendicular to a d-line ¢ not 
through P, and let p be the length of the segment of ¢’ from P to £. What is The angle ¢ is called ‘the 
the angle gy between ¢’ and a d-line through P which is parallel to £? angle of parallelism’. 

Let Q be the foot of the perpendicular from P to £, and R a point on ¢ dif- 
ferent from Q. As the point R moves away from Q towards @, the hyperbolic 
length of QR, d(Q, R), tends to infinity; it follows that tanh 2d(Q, R) tends to 
1 as R approaches @. 

From Lobachevskii’s Formula, 

tanh 2d(Q, R) 


tan ZOPR = ——————__, 
sinh 2p 


and so, if we let R approach @ along £, it follows that 


1 
sinh2p° 


tang = 


We have proved the following result. 


Corollary 1 Angle of Parallelism 
Let € be ad-line, and P a point of J that does not lie on £. Then the angle g 
between the perpendicular from P to ¢ (of hyperbolic length p) and either 


d-line through P that is parallel to @ is given by Qe? 
($2\ 
tang = — ' 1 a 1 
-— ay, 


For example, the hyperbolic distance of the point si from the diameter 
£ = (-1,1) is tanh! 0.5 ~ 0.5493. It follows that the angle y between the 
perpendicular from si to £ and either parallel to @ through si is given by 


(0, $i)=p 


1 In fact, the remarkable 
tang = — f ] 
sinh (2 tanh! 0.5) pet 
2 
1 sinh(2 tanh 1x) = 
1— x2 


~ sinh 1.0986’ 


gives tang = ; exactly. 


hence, g ~ 0.6435 radians. 


Problem 10 Determine the angle between the perpendicular from the 
point 3 to the d-line @ with equation x = 0 and either parallel to ¢ 
through 3. 


Finally we give, without proof, another useful formula. 
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Theorem 10 Sine Formula 
Let AABC be a d-triangle right-angled at C. Let a and c be the hyperbolic — A proof of this result will 


lengths of BC and AB. Then be found as an Exercise in 
Section 6.7. 
sinh 2a 


sin A = — : 
sinh 2c 


Remark 


This result agrees with the expected one when the triangle is small, for in such 
cases sinh 2a ~ 2a and sinh 2c ~ 2c, so sin A ~ a/c, which is the Euclidean 


angle A 


result. 


Problem 11 Use the Sine Formula to find the length a of the sides of 
a hyperbolic equilateral triangle AABC with angle 0, as a function of 0. 
Hint: Apply the Sine Formula to the triangle obtained by dropping the 
perpendicular AD from A to BC. 


Proofs 


We now supply proofs of two results that you met earlier in this subsection. 
The formulas in Theorems 8 and 9 depend on the following remarkable 
formula: 
14x 
= — (4) 


cosh (2 tanh! x) = 
1l-x 


The substitution x = tanht shows that equation (4) holds if and only if 


1 + tanh? ¢ 
cosh 2 = —————_,, 
1 — tanh? ¢ 
and this formula follows from the definitions of cosh and tanh, as you can 


check. 
For future convenience, we now list various expressions that we have now 
met for the inverse tanh function. 


For x € (—1, 1), the following are all equal to tanh! x: 


1 Ware Wy ee 1 
0 5 = Si ; : 
ene Tes Tae 


Theorem 8 Pythagoras’ Theorem 
Let AABC be a d-triangle in which the angle at C is a right angle. If a,b 
and c are the hyperbolic lengths of BC, CA and AB, then 


cosh 2c = cosh 2a x cosh 2b. 
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Proof ‘To prove Pythagoras’ Theorem in hyperbolic geometry, by the Origin 
Lemma we may assume that C is at the centre of Y, the point A is at the point 
a’ on the horizontal diameter and the point B is at ib’ on the vertical diameter, 
where a’ and b’ are real and positive. 


Here a, b,c are real 
because they are the 
hyperbolic lengths of the 
sides of the d-triangle 
AABC. 


From the Distance Formula, it follows that 
a=tanh!b’ and b=tanh7! a’. (5) 


We now map JY to itself by the hyperbolic transformation 


z-a’ 
M(z) = = 


Under M, A goes to the origin (which we shall also call A’, for clarity), B goes 
to the point B’, with complex coordinates 
if = ib’ —a' 
1 —ia'b”’ 
and C goes to the point C’, with coordinates —a’. 
Because the hyperbolic transformation preserves lengths, the hyperbolic 
length c of AB is equal to the hyperbolic length of A’ B’. 


To find c, the hyperbolic length of AB, we first find the modulus of b”. 
We calculate its square: 


we 2 lib =a’? _ a + b? 6 
|1 — ia'b’|? 1+a2pb2° 


Now we observe from formula (4) and equation (5) above that 


1 p2 
cosh 2a = cosh (2 tanh”! b’) eae , 
1—b2 
1 12 
cosh 2b = cosh (2 tanh! a’) = ee 
1—a? 


also, since c = tanh™! |b”’|, we get from the formula (4) that 


1+ |b"? 
cosh 2c = cosh (2tanh-! 1b") = Paley 
1— bP 
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But, it then follows from equation (6) that 


(2 512 

1+ [2 1+ tap 
cosh 2c = = 

_ |b’ |2 1 a’2+b’2 

~~ Ja q2pr 


lta?24+b2 +a?b?2 
= 1 —q2 — b2 + q'2b/2 
14+ b2 l+a” 
~ Tb? * Ta?’ 


cosh (2 tanh7! v) x cosh (2 tanh! a’) 


= cosh 2a x cosh 2b. 


This proves the analogue of Pythagoras’ Theorem in_ hyperbolic 
geometry. | 


Theorem 9 Lobachevskii’s Formula 
Let the d-triangle AABC have a right angle at C, and let the hyperbolic 
lengths of sides AC and BC be b and a, respectively. Then 


7 tanh 2a 


tan A = — : 
sinh 2b 


Proof We proceed as in the proof of Theorem 8, by mapping the 
d-triangle AABC with vertices at a’,ib’ and 0, respectively, by the hyperbolic 


transformation M(z) = oe onto the d-triangle with vertices at 0, 


We obtain the second 


ib’—a’ — —a’ (1+ b”) + ib’ (1-a”) 


p= _ (7) fraction in equation (7) by 
1 —ia’b’ l+a?b? multiplying the numerator 
and denominator of the 
and —a’, respectively, where b = tanh™! a’ and a = tanh! D’, so that first fraction by 1 + ia‘b’. 


a’ =tanhb and b’ =tanha. 


The mapping also sends the angle A in AABC onto an angle of equal size 3’. »” angle A 
(which we also denote by A by a slight abuse of notation) at the origin. Since ta a 

the triangle with vertices at 0,b” and —a’ is right-angled at —a’, it follows © © A’ 
from equation (7) that a 
b’ (1 = a’) 


A> 3 
tan a’ (1 +62) 


(8) 
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Now, since a’ = tanh b, we can write 


l-a?  1-—tanh?b 
a’ = tanh b 
cosh? b — sinh’ b 2 
~ “sinhb-coshb sinh 2b 
Similarly, we can verify that 
b 
1+b? 


_ 1 
=35 tanh 2a. 


: : . =e y : 
Hence, if we substitute these expressions for Ls and en into equa- 


tion (8), we find that 


_ tanh 2a 
~~ sinh 2b" 


6.4.4 Equidistant Curves to d-Lines 


In the Euclidean plane R?, the set of points at a fixed distance from a given 
line @ is a pair of straight lines parallel to £. In the hyperbolic disc J there are 
two curves equidistant from a given d-line, though neither is itself a d-line. 


Theorem 11 Let ¢ be ad-line that ends at points A and B on @, and let £’ 
be that part in Z of a (Euclidean) circle through A and B, and let w be the 
angle (in Z) between ¢ and ¢’. Then all points on ¢’ are the same hyperbolic 
distance d from ¢, where 


d= }log, (tan($ + 3)}. 


Proof First, map @ and J to themselves by a Mobius transformation M that 
maps A and B to —1 and 1 (not necessarily respectively) and maps ¢’ to a 
curve in the upper half of %. Since Mobius transformations preserve angles 
and generalized circles, it follows that the image of the d-line AB is the diam- 
eter [—1, 1], and the image of ¢’ is an arc of a circle in the upper half of F 
through +1 making an angle a (in Y) with [—1, 1]. 
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Here we use the identities 
cosh? x — sinh? x = 1 
and 
sinh 2x = 2 sinh x coshx 


forx € R. 


It is not hard to verify that 
such an M exists. 
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In view of the above remarks, it is sufficient to assume that the d-line € Similarly, we shall refer 
is the diameter (—1,1) of Y. Let p be a point on £’, and m the foot of — below to ¢’ rather than to 


the perpendicular r from p to (—1, 1). Let M; be the Mobius transformation M(e’). 
defined by 
Mi(z) = ——. 
1—mz 

Then M, maps JY to itself, and (—1, 1) to itself. It maps m to 0, é’ to an arc 
of a (Euclidean) circle, with radius R and centre C (say), through +1; and it 
maps r to a portion of a d-line through O that is perpendicular to (—1, 1)— 
and so to a portion of the imaginary axis. It follows that the point P = M,(p) 
is of the form ai, for some real number a € (—1, 1). Since M is a hyperbolic 
transformation it does not alter hyperbolic distances, so that Property 6, Subsection 

6.3.1 


d(p,m) = d(0,ai) = d(0,a) 
= d, say. 


Now, if Q is the point where the perpendicular to CB at B meets OC, the 
triangles ACBQ and ABO@Q are similar, since ZCBQ = 5 = ZBOQ and 
ZBCQ = ZOQB. It follows that ZBCQ = ZOBQ = a. Also, from AOBC we 


see that R sina = OB = 1. Hence Here we use the identities 


a= R—Rcosa cos 2x = 1 —2sin? x 
_ 1 — cosa and 
sina 
9 sin2 ) sin2x = 2sinx cos x 
~ 2sin (%) cos (%) = tan (5). forxéR 
2 2 . 
It follows that Recall from equation (3) 
of Subsection 6.3.1 that 
a 
d = tanh~! a = tanh! (tan (5)) ere ee (4) 
an = 5log, {| —— 
1-k 
Ly ( I+ tan | 
= 7 108 for any k > 0. 
2 e YK = 
1 — tan ($) 
1 
= 3 log, (tan ($ + 7). 
This completes the proof of the theorem. a 


For example, the portion in the upper half of Y of the Euclidean circle 
through -+1 that makes an angle of 2/4 with the segment [—1, 1] is a set of 
points equidistant in Y from [—1, 1], with the distance of separation being 


d= 5108, (tan (= + *)) — 5 log, («an (=) ~ 0.4407. 
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6.5 Area 


6.5.1 Area ofa d-triangle 


In the Euclidean plane R2, the area of a set is defined in terms of the union 
of areas of small rectangles. In the hyperbolic disc Y the situation is some- 
what different; d-quadrilaterals cannot have four right angles and so do not 
conveniently exhaust areas in an analogous way. 

We want our definition of area in hyperbolic geometry to have such 
properties as the following: 


e The area of a d-triangle is non-negative, and is zero only if its Euclidean 
area iS Zero; 

e d-congruent d-triangles (and d-congruent asymptotic d-triangles) should 

have the same area; 

if one d-triangle can be fitted inside another, it should have smaller area; 


e area should be additive. 


Then we can determine the area of any figure in Y that can be divided up into 
d-triangles. 

Throughout this subsection, all ‘lines’ and ‘line segments’ will be d-lines and 
d-line segments, but to avoid constant repetition we will simply drop the ‘d’. 


Notice first that as d-triangles grow in size their angle sum decreases. For 
example, let AABC be contained in AAB’C, where A, B and B’ are collinear, 
as shown. Denote by a, 8, y and a, f’, y’ the angles of the two d-triangles, 
as shown. Consider the d-triangle ABC B’. The angles at B and C in this are 
x — B and y’ — y. Then, since the sum of the angles of a d-triangle is less than 
zr, it follows that 


(a —B)+(y'—y) +6 <Z, 
so thatB +y > B’+y';hencea+ B+y >a+ fp’ + y’, as asserted. 


Problem 1 Prove that a trebly asymptotic d-triangle can be divided 
into two doubly asymptotic right-angled d-triangles. 

Hint: Consider a new vertex sliding along one edge; how do the angles 
vary? 


Now, since ‘very small’ d-triangles are d-congruent to small d-triangles near 
the origin, where d-lines are ‘approximately Euclidean’ lines, the sum of the 
angles of “very small’ d-triangles is close to z. 

At the opposite end of the size scale, we can use the result of Problem 1 
above to prove the surprising fact that all trebly asymptotic d-triangles are 
d-congruent to each other. 


Theorem 1 All trebly asymptotic d-triangles are d-congruent to each 
other. 
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Theorem 2, Subsection 
6.4.1 


Theorem 1, Subsection 
6.4.1 


Theorem | below 
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Proof It is sufficient to prove that trebly asymptotic d-triangles are 
d-congruent to the trebly asymptotic d-triangle with vertices —1, 1 andi. 

So, let AABC be any trebly asymptotic d-triangle, where (for convenience) 
we assume that the vertices A, B, C occur in clockwise order on @. Let £ denote 
the line AB, and let D be any point of @ in J. 

Then, by the Origin Lemma, we can map Y to itself and D onto the origin 
O by a Mobius transformation, M, say. It follows that M; maps AB onto a 
diameter of J. 


A - i 
YX Oy, 4 7 
\ \ ee. 
¥ See eee Se 
M, 6 M, 6 M; 6 


We can then map % to itself by another Mobius transformation, M2 say, that 
rotates the disc about the origin and maps A and B onto —1 and 1, respectively. 
Since Mobius transformations are direct hyperbolic transformations, they pre- 
serve the orientation of points; it follows that the image of C under M2 o M, is 
some point p, say, on the upper half of @. 

Now, for any real number m € (—1, 1), the Mébius transformation 


Z—m 
M3(z) = 


1 —mz 
maps YZ to J, © to@, —1 to —1, 1 to 1, and the upper halves of Z and @ to 
themselves. It also maps p toi if m is chosen so that 


_ p-m 
i = ——_.,, 
1—mp 
which we can rewrite in the form 
i 
ee am (1) 
1 — pi 


Then, using the facts that |p| = 1 andIm p > 0, we can check that equation (1) | We omit the details. 
indeed provides a value of m that is real and strictly between —1 and 1. 
Composing the three Mobius transformations that we have met, the compos- 
ite Mébius transformation M3 0 M2 o M,; maps the trebly asymptotic d-triangle 
AABC onto the trebly asymptotic d-triangle with vertices —1, 1 and i, as 
required. | 


Trebly asymptotic d-triangles will play a crucial role in our discussion of 
hyperbolic area. First, we see that every d-triangle can be fitted inside some 
trebly asymptotic d-triangle. 
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Theorem 2 Let AABC be a d-triangle. Then there exists a trebly asymp- 
totic d-triangle ADEF that contains AABC. 


Proof First, we construct a trebly asymptotic d-triangle ADEF as follows: 
extend the segment AB beyond B to meet @ at D, the segment BC beyond C 
to meet @ at E, and the segment CA beyond A to meet @ at F. Then draw the 
trebly asymptotic d-triangle ADEF. 

Now, the d-line containing the points A, B, D cuts off a minor arc of @ that 
contains F’. It follows that the d-line through D and F lies on that side of 
the d-line through A, B, D, and (in particular) that the d-triangles ADAF and 
AABC do not overlap. 

Similarly, the d-triangles AEBD and AABC do not overlap, and also the 
d-triangles AFCE and AABC do not overlap. Also, none of the d-triangles 
ADAF, AEBD, AFCE overlap. 

It follows that AABC is contained within ADEF, as required. | 


Then, since every d-triangle can be fitted inside some trebly asymptotic 
d-triangle, the following completely surprising result (which holds whatever 
our definition of hyperbolic area is) proves the existence of an upper bound for 
the area of all d-triangles. 


Theorem 3 All trebly asymptotic d-triangles have the same (hyperbolic) 
area, and this is a finite number. 


Proof Using the result of Problem 1, we can divide a given trebly asymp- 
totic d-triangle into six singly asymptotic d-triangles, as shown in the margin 
diagram. Therefore it is sufficient to prove that the area of a singly asymptotic 
d-triangle is finite. 

Let AABC be a singly-asymptotic d-triangle, with C on @ (as in the left 
figure below). First, extend AB to meet @ at a point C’, and bisect ZBAC by 
AD, where D is a point on CC’. Next, (hyperbolically) reflect ZBAC in AD; 
since AB has finite hyperbolic length and AC has infinite hyperbolic length 
(since C lies on @), this reflection maps B to some point Aj, say, on AC, 
as shown in the right diagram below. Also, the reflection must map BC onto 
A\C’; and, by symmetry, BC and A;C’ must meet at some point, By say, 
on AD. 

Next, bisect ZC’ BC by BDo, where Do is on C’'C, and bisect ZC’ AC by 
A,Dy,, where D, is on C’C, as shown below. 
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Note that the area must be 
finite, whatever definition 
of hyperbolic area we 
choose to use. 
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Our strategy of proof is now as follows. The d-pentagon AA; D; DoB and 
its boundary all lie in Y, and so at a minimum positive distance from @, the 
boundary of Y. Also, by adding additional line segments B; Do and B; Di we 
can divide the d-pentagon into eight d-triangles; each d-triangle has finite area, 
and so the d-pentagon must also have finite area. We will show that ABAC 
can be cut up and reassembled (iteratively) inside this d-pentagon, and so has 
an area less than or equal to the area of the d-pentagon; in particular, it must 
have finite area. 


We now do to AB, AiC what we did previously to ABAC. Reflect (in a 
hyperbolic way) AB, AC in A; D, : B, maps to some point Az on A;C and 
B,C maps to A2C’, which meets B,C at some point Bz on A, Dj. 

AB2Aj;Az2 is congruent to several d-triangles in the d-pentagon; for con- 
venience we choose the black one on the left in the figure above. We then 
repeat the process for AB) A2C. We bisect AB2A2C with A2D2, where D2 
is some point on C’C. The cross-hatched d-triangles in the above figure are 
d-congruent — one is in ABAC and one is in the d-pentagon. 

Continuing indefinitely, we do for AB,+1Ay+1C what we have just done for 
AB, AnC. With each pair of hyperbolic reflections we get two new d-triangles 
in ABAC and two new d-triangles in the d-pentagon. These d-triangles are 
d-congruent by means of a sequence of hyperbolic reflections in the d-lines 
An+1Dn41; AnDny, Se AD, and AD. 

In this way, we reassemble the asymptotic d-triangle inside the d-pentagon; 
hence the asymptotic d-triangle must have finite area. 

It follows that all trebly asymptotic d-triangles have finite area, and so 
(as we noted before the theorem that all trebly asymptotic d-triangles are 
d-congruent) they have the same area. | 


Problem 2 


(a) Prove that a doubly asymptotic d-triangle is a subset of a suitably 
chosen trebly asymptotic d-triangle, and so is finite. 

(b) Prove that the area of a doubly asymptotic d-triangle depends only 
on the angle at its vertex in J. 
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We can now give Gauss’ magnificent proof that the area of each d-triangle 
is proportional to the quantity FE 


mz — (angle sum). 


Theorem4 The area of ad-triangle with angles a, 6, y is given by K (x — Ws 
(a + 6+ y)), where K is the same constant for all d-triangles. 


D E 
Proof Let AABC be a d-triangle with angles a, B,y at vertices A, B,C, ce 


respectively. Extend the line segment AB to a point D on @, the segment BC to 
a point E on @, and the segment CA to a point F on @. Since ADEF is trebly 
asymptotic, it has finite area, k say, by Theorem 3. 

Now, we have seen that any doubly asymptotic d-triangle has an area which 
depends only on the angle at its vertex in Y, and so only on the ‘exterior angle’ Problem 2, part (b) 
of the triangle at its vertex in Z. So, we define a function f such that the area 
of any doubly asymptotic d-triangle with exterior angle 0, say, in is f (0). 

Then, it is clear from the margin figure above that in AFAD this exterior 
angle equals a; so, the (hyperbolic) area of AFAD is f(a). 

If the segment FAC extended meets @ at X (as in the left figure below), then 
ADXF is trebly asymptotic and is composed of the two doubly asymptotic 
d-triangles AFAD and ADAX. Then, if we denote the area of each trebly- 
asymptotic d-triangle (such as ADXF) by k, say, and notice that (from our 
definition of f) the area of ADAX is f(z — @) since its exterior angle at A is 
az — a, it follows that 


k= f(a)+ fi —a@). (2) Here we use the fact that 
Our next objective is to use equation (2) to determine an explicit formula for f. io ag ated ts 
additive. 
F 
D 
E 
X 


So, consider any trebly asymptotic d-triangle, as shown in the figure on the 
right above. By drawing radii from the center of J to its vertices on @, we can 
divide this d-triangle into three doubly asymptotic d-triangles. If we denote 
the angles at O of these three d-triangles by 7 —a, m — 6 anda+ f, as shown 
above, we have 


k= f(a) + f(B) + fir — (a+ B)) 
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By equation (2), f(a — (a+ B)) =k — f(a + B), so that 
f@)+ f(B)+k— flat B) =k, 
that is, 
f(a) + f(B) = flat B). (3) 


It follows from equation (3) that f is linear, so that f(@) = Aa for some 
constant A. 
Using this formula for f, we deduce from equation (2) that 


Ma)+rA(m7 —a) =k, 


so that A = A. Hence f(a) = Ka. 

Returning to the original d-triangle AABC in the margin figure above, the 
area of AABC is equal to the area of ADEF minus the sum of the areas of 
the three doubly asymptotic d-triangles AFAD, ADBE and ACAF. Since the 
latter four areas arek, f(a), f(B) and f(y), respectively, it follows that 


area of AABC = k — (f(a) + f(B) + f(y)). 


Using the formula for f that we have just found, we can write this equation as 


k k k 
area of AABC = k a+—B+—-y], 
IT T 1 


and so as 
area of AABC = K(x — (a+ f+y)), 


where K = k/r. |_| 


It is usual to take K = | (which is equivalent to taking the area of a trebly 
asymptotic d-triangle to be 7). Then 


areaof AABC =z —-(a+6+y). 


Now this result is all very well, but how do we know whether, given any pos- 
itive real numbers a, 6, y witha + 6+ y < 7, there actually is a d-triangle 
whose angles are a, 8, y? 

A key tool in showing that such a d-triangle exists, is the following result. 


Lemma 1 Let P be a point on a d-line @ of Y, and let 0 < B < a Then 
there are exactly two d-lines through P that make an acute angle 6 with @, 
and each is a (hyperbolic) reflection of the other in @. 


Proof Since all the properties in the statement of the Lemma are hyperbolic 
properties, and there is a hyperbolic transformation (for example, a Mobius 
transformation) that maps the point P to the origin O, it is sufficient to prove 
the result when P is the origin. 
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We omit the details of this 
fact, which uses a 
continuity argument. 


We again use the fact that 
(hyperbolic) area is 
additive. 


Note that here K is 
arbitrary. 


The expression 
mz—(a+f6+y) is called 
the angular defect of 
AABC. 


Recall that, if a, 6, y are 
positive real numbers with 
a+ 6+y =7, then such 
a Euclidean triangle 
exists. 
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Then there are (by Euclidean geometry) exactly two diameters of Y (and 
diameters are d-lines) through the origin that make an acute angle 6 with the 
given diameter @, and each is a reflection of the other in €. This completes the 
proof. | 


We are now in a position to prove the final result in this subsection. 


Theorem 5 Leta, 6, y be positive real numbers withO <a+f+y <7. 
Then there is a d-triangle in Z with angles a, B, y. 


Proof Since0 <a+6+y < 7, at least one of the numbers lies strictly 
between 0 and ae let us suppose that this number is 6. Next, let A, P Q be the 
points in ZY with coordinates 0, 1, cosa +i sina , and let £ be a variable d-line 
in Y whose initial position is as the d-line with endpoints P and Q. Then the 
angles of the doubly-asymptotic d-triangle AAPQ are a, 0 and 0. 

The first step is to keep fixed the endpoint of ¢ that is originally at Q, and 
let the endpoint that is originally at P move round @ away from Q. As this 
endpoint moves away from P, the angle @ between ¢ and the radius AP varies 
continuously, starting at 0. By the time that the moving endpoint reaches Q’, 
the image of Q under Euclidean reflection in the radius AP, the angle 6 has 
reached the value 4. It follows, by continuity and the intermediate value prop- 
erty, that there is some point, R say, on AP such that for the d-line through P 
and R the corresponding value of the angle @ is f. 

Now, it follows from Lemma | that through every point S between A and R 
there is a d-line that makes an angle # with the radius AP; also, if we assume 
that this angle 6 lies above the radius AP and contains part of the second 
quadrant of Z in its interior, this d-line is unique. 

We now let S move along AP from R towards A, taking with it the corre- 
sponding d-line £. Denote by T the point where the variable d-line ¢ intersects 
the radius AQ. Then, as S moves along AP from R towards A, T moves along 
AQ from Q towards A. 

The d-lines £; and £2 corresponding to successive positions S$; and Sz of S 
do not meet in Y. For, if they met at a point X in JY, then the external angle 
at Sz of the d-triangle AXS,S2 would be less than or equal to the sum of the 
opposite two internal angles, which is impossible by the result of Problem | in 
Subsection 6.4.1. (The following figure shows the situation when X is in the 
upper/lower half of Z, respectively.) 


A = >R 


>R X 
S5 Sy 
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Hence as S moves monotonically along AP from R towards A, T moves 
monotonically along AQ from Q towards A. As S approaches A, the curvature 
of the d-line € decreases, and £ approaches the diameter d of Y that makes an 
angle 6 with the diameter containing AP. 

As T moves monotonically along AQ from Q towards A, the angle at T 
between the radial line AT and @ starts at 0, and approaches the value z — 
(a + B) as T approaches A. It follows, by continuity and the intermediate 
value property, that there is some point, C say, on AQ such that at C the d-line 
£ makes an angle y with the radius through C. 

If we denote by B the point where the d-line € through C meets AP, then 
the triangle AABC has the desired angles a, B, y. |_| 


6.5.2 ‘Tessellations 


The term tessellate means ‘fit together exactly a number of identical shapes, 
leaving no spaces’, as in the making of a mosaic. For example, we may tes- 
sellate the Euclidean plane using rectangles of a given size in several ways, 
and we may tessellate it in other ways as well — as illustrated below. In these 
examples, we may move a given occurrence of any shape onto any other occur- 


; ; ee ; You saw in Theorem 2 of 
rence by a Euclidean transformation, and so by a combination of Euclidean — sybsection 5.2.1 that 


reflections. every Euclidean 


transformation is a 
composite of reflections. 
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Euclidean tessellations using a single figure are sometimes called ‘wallpaper _ Patterns like those in the 
figure above are found 
among the tilings of the 
: . Alhambra Palace in 

17 different wallpaper patterns, in the sense that the symmetry group of any — Granada (Spain), and 


wallpaper pattern is isomorphic to one of only 17 non-isomorphic groups. elsewhere in Moorish art. 


patterns’, for obvious reasons! With each such figure we can associate its group 
of symmetries; it is an interesting piece of mathematics that there are only 


Area 


Tessellations occur in hyperbolic geometry also. For example, we may start 
with a symmetric figure, a symmetric d-triangle AABC, in Y bounded by 
three d-lines (as in the following figure), reflect AABC in each d-line in turn 
to obtain 3 copies each d-congruent to AABC, and so on. This produces a 
hyperbolic tessellation of Y, as shown in the figure below; for clarity we have 
shaded all the copies that can be obtained from AABC by direct hyperbolic 
transformations. All the copies are d-congruent to each other, even though to 
a Euclidean eye they seem to be of various different shapes and sizes! 


Another tessellation of Z was suggested to the graphic artist M. C. Escher 
by the geometer H. S. M. Coxeter, and resulted in the picture below, Circle 
Limit IV. The angels and devils rest on a tessellation of Z by d-triangles. 


6.5.3 Kaleidoscopes 


We now explore the ideas underlying the (Euclidean) kaleidoscope, and 
explain how hyperbolic tessellations can be constructed. 
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Harold Scott MacDonald 
Coxeter (1907-2003) was 
born in London, guided 
towards mathematics by 
Bertrand Russell, and 
spent most of his life in 
Canada. His work on 
regular polytopes led to 
the discovery of Carbon 
60, and to the famous 
geodesic dome of 
Buckminster Fuller. 


Maurits Cornelis Escher 
(1898-1972) received his 
graphic art training in 
Arnhem and Haarlem, and 
worked in Italy, 
Switzerland, Belgium and 
Holland. 

M. C. Escher’s ‘Circle 
Limit IV’ © 2010. 

The M. C. Escher 
Company-Holland. All 
rights reserved. 
www.mcescher.com 


A kaleidoscope is a 
machine that produces 
wallpaper patterns using 
reflections in mirrors. 
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A Euclidean Kaleidoscope 


If two mirrors meet at right angles, an angle of 2/2, any point between the 
mirrors has an image in each mirror; then, each image itself has an image in 
the mirrors — giving a total of four points in all, lying on a circle centred at the 
vertex where the mirrors meet. 

If the mirrors meet at an angle of 2/3, there are 6 points in all; and, in 
general, if the mirrors meet at an angle z/k there are 2k points in all. 


e 
initial point 


initial point 
e 


A real kaleidoscope has three mirrors, each pair meeting at an angle z/k, 
where k is some integer. For example, consider the case where the mirrors 
meet at angles of 2/2, 2/3 and 2/6. Corresponding to these vertices there are 
4 points, 6 points and 12 points — making a total (in the first instance) of 17 
points. But each of these points in turn has images in the mirrors, and soon..... The total number is 17 
The whole pattern of images spreads through the plane! rather than 22, since some 


of them are repeated. 


initial point 


Area 


Since reflection is a Euclidean isometry, the process of reflections fills the 
whole plane with congruent shapes. 


A ‘Hyperbolic Kaleidoscope’ 


Hyperbolic reflections and transformations are hyperbolic isometries; that is, 
they map figures without altering the hyperbolic distances between their points 
even though the images may look (to a Euclidean eye) very different in size 
and shape from the originals. In particular, they map d-triangles to d-congruent 
d-triangles. 

If two mirrors along d-lines meet at an angle of 2/2, any point between the 
mirrors has an image under hyperbolic reflection in each mirror; then, each 
image itself has an image in the mirrors. Again, we obtain a total of four points 
in all, lying on a hyperbolic circle centred at the vertex. 

Similarly, if the mirrors along d-lines meet at an angle of 7/3, there are 6 
points in all; and, in general, if the mirrors meet at an angle of 2/k, there are 
2k points in all. 


e 
initial point 


initial point 


initial triangle | 


e 


Now construct a hyperbolic analogue of the Euclidean kaleidoscope, with 
three mirrors along d-lines in Y: two of them lying along d-lines through the 
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origin that meet at O at an angle of 2/7, and the third lying along a d-line that 
meets the first two at angles of 2/2 and 7/3. We reflect the triangle in each 
side in turn (using hyperbolic reflections), and continue to reflect indefinitely 
in the three d-lines. 

Since hyperbolic reflection is a hyperbolic isometry, the process of reflec- 
tions fills the whole unit disc with d-congruent d-triangles. The figure that we 
end up with is just the design on the front of the book and in Subsection 6.4.1. 


6.6 Hyperbolic Geometry: the Half-Plane Model 


In previous sections we described the disc model of hyperbolic geometry, in 

which the unit disc Y is the space of points, and the transformations of this 

space as being of the form 

ze M(zZ)= ai or ze M(Z)= = = 
bz+a bz+a 

However there is a completely equivalent hyperbolic geometry, in which the 


space (the set of points) is the upper half-plane 
H={z:zeEC, 

and the h-lines in H (the analogues of d-lines in Y) are the restriction to H 
of vertical lines in the plane together with the restriction to H of circles with 
centre on the real axis in C. Notice that the h-lines are all arcs of generalized 
circles that meet the boundary of H at right angles, just as d-lines are all arcs 
of generalized circles that meet the boundary @ of F at right angles. 
The Mobius transformation z +> i i maps Y one-one onto H, and pro- 
vides a convenient method for ‘transferring’ the geometric properties in J to 
corresponding properties in H. For example, it maps the d-lines in F to the 
h-lines in H. 

The hyperbolic transformations in H are the mappings of H to itself 
given by 


where |b| < |a|. 


Im z > 0}, 


az+b 
cz+d 
where a,b,c and d are real, and ad — bc > 0. These are the composites 
of reflections in h-lines, which are just inversions in the Euclidean circles of 
which the /-lines form a part. 
The geometry in H has many properties similar to those in %: 


zt M(z)= 


e the sum of the angles of an h-triangle is less than zr; 

e two h-triangles are h-congruent if their angles are equal in pairs; 

e the area of an A-triangle with angles a, 6, y is K(m7 —(a+f+y)), for some 
K>0; 

e h-circles are Euclidean circles in H, and vice-versa. 


Infinitesimal hyperbolic distances ds in H are related to their Euclidean 
lengths by the formula ds = az , from which it can be shown that geodesics 


in A are the /-lines. 
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It is not hard to verify that 
such a d-line must exist. 


Note that 
Iv IU us 
i ase ae 


That is, there is a one—one 
correspondence between 
points of the two spaces 
and the transformations in 
the two geometries. 


Generally we take K = 1, 
as in Subsection 6.5.1. 


Recall that a geodesic is 
the curve of shortest 
length joining two points 
in the space. 
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Finally, we comment that the hyperbolic geometry of H turns out to be 
of great importance in many unexpected branches of mathematics — such as 
Number Theory, in which tessellations of H such as the following occur 
naturally. 


6.7. Exercises 


Section 6.1 
1. Sketch the following d-lines: 


€)={@, y):x+y=0}NF; 
f= {yi +y? + 2W2y +1 =0} 09; 


=|: +y?-V5y +1 =0n9; 
ty={@y) ix? +y?—3x +1 =0}ng, 


Hence decide which of the d-lines intersect, which are parallel and which 
are ultra-parallel. 

2. Sketch three d-lines €;, 22 and £3 with the following properties: €; and £3 
are parallel, £2 intersects €; and £3, and the angles at which £2 crosses ¢; 
and £3 are not equal. 

(This shows that the Corresponding Angles Theorem in Euclidean geom- 
etry does not hold in hyperbolic geometry.) 

3. Verify that the points (- we +3) and (- wo - +3) lie on the circle C with 


equation 


x+y? 42/2x+1=0. 
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Hence write down the equations of the diameters of Z which are parallel to 
the d-line € which is part of C. 

4. Use the Origin Lemma to show that, given two d-lines meeting at right 
angles, there is a hyperbolic reflection mapping them to two perpendicular 
diameters of J. 

5. Let €1, €2 and £3 be any three d-lines such that hyperbolic reflection in any 
one exchanges the other two. 

(a) Prove that there is some point P of Z which lies on each of the three 
d-lines. 

(b) Use the Origin Lemma to prove that the acute angle between any two 
of the d-lines is 3. 

6. Verify that the following equations are the equations of generalized circles 
of which d-lines are a part, and sketch the corresponding d-lines. 
(a) x*>+y?+4x+1=0 
(b) x27 + y?+2x —2y+1=0 

7. Use the fact that the circle with radius a and centre (1,a), where a > 0, 
touches the x-axis to write down the equation of a d-line through the point 
xi that is parallel to the d-line with equation y = 0. 


Section 6.2 


1. Determine the general form of the direct hyperbolic transformations that 
map 0 to 0 and the diameter y = x of F to the diameter y = 0 of %. 
2. (a) Determine the general form of the direct hyperbolic transformations 
that map each of the following points to 0: 


(b) Determine a direct hyperbolic transformation that maps -5 to 5 - zi . 


(c) Determine the general form of the direct hyperbolic transformation that 
maps a xi to —3. 

3. By drawing a sketch, or in some other way, decide whether the effect 
of successive hyperbolic reflections in d-lines €; and then €2 given by 
the following equations is a (hyperbolic) rotation, a limit rotation or a 
translation. 

(a) €): x7 +y? —2x —2y+1=0; £29: 47+ y?+2x4+2y4+1=0. 
(b) 2, : x? + y* —2x —2y+1=0; £9: 47 + y*-—2x4+2y4+1=0. 

4. Let M be a direct hyperbolic transformation mapping the point p in J to 
the point g. Let M, be a direct hyperbolic transformation mapping the point 
q to 0. 

(a) Show that the composite transformation M’ = M, o M maps p to 0. 

(b) Deduce that M is the composite of a hyperbolic transformation 
mapping p to O followed by a hyperbolic transformation mapping 
O tog. 
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5. (a) Prove that the set of all direct hyperbolic transformations forms a group 
under the operation of composition of functions. 
(b) Prove that the set of all hyperbolic transformations forms a group under 
the operation of composition of functions. 


Section 6.3 
1. Determine d (- si, 0) and d(0.9, 0.99). 


2. Determine the positive number a for which d(0,a) = 2, and the negative 
number b for which d(0, b) = 10. 
3. Determined d (-3. si). 
4. (a) Determine the hyperbolic midpoints of the d-line segments [0.1, 0.9] 
and [—0.1, 0.9]. 
(b) Determine the hyperbolic centre and radius of the circle C = 
{z: |z —0.5| = 0.4}. 
(c) Determine the equation of the hyperbolic perpendicular bisector @ of 
[0.1, 0.9]. 
5. Determine the Euclidean centre and radius of the hyperbolic circle 
C= {z:4 (4.2) = i]. 
6. LetO0 <a <b < 1, and let the d-line ¢ with equation x*+ y?—2cx+1 = 0, 
where c > 1, intersect the radius [0, 1) between A(a,0) and B(b, 0). 
(a) Prove that £ intersects AB at the point (c —Vc* —1, 0). 
Now assume that ¢ is the hyperbolic perpendicular bisector of AB. 
(b) Use the Reflection Lemma to prove that 


= l+ab 
~ atb- 


(c) Deduce from parts (a) and (b) that the hyperbolic midpoint of AB is 


1+ab—J/(1—a?)(1 — b?) 0 
a+b . , 


7. Let & be a d-line that is not a diameter of Z and whose endpoints on @ 
subtend an angle 0,0 < 6 < z, at the centre of Y. Prove that the hyperbolic 
length of the d-line segment from the origin to ¢ that is perpendicular to @ 


is tanh! (cot AG + )). 


Section 6.4 


1. ABCD is a d-quadrilateral. The d-lines through A and B and through D 
and C are parallel, with a common boundary point P on @; the d-lines 
through D and A and through C and B meet at a point E in J, as in the 
figure in the margin (in which A is at the origin). 

(a) Prove that it is impossible that ZEAB = ZEDC. 
Hint: Consider the Euclidean triangle AADP. 
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(b) Prove that it is impossible that ZEBA = ZECD. 
Hint: Consider the Euclidean triangle APBC, with B at the centre 
of J. 
(c) Prove that the following combination is impossible: 


ZEAB=ZECD and ZEBA= ZEDC. 
(d) Prove that the following combination is impossible: 
ZEAB= ZEBA and ZEDC = ZECD. 


2. Let €; and €2 be a pair of perpendicular diameters of Y. Sketch €; and 
£5, and some of the ultra-parallels to €; for which @2 is their common 
perpendicular with €,. 

3. ABCD is a d-quadrilateral such that the hyperbolic length of each side is 1. 
The d-line segments AC and BD are of equal hyperbolic length and inter- 
sect at right angles at the origin O, their common midpoint, and OE is an 
altitude of the d-triangle AOAB. Also, P is the hyperbolic midpoint of OE. 

Determine the hyperbolic length of OE, the Euclidean length of OP, the 
angle ZABC, the hyperbolic length of OB, the Euclidean length of BD, and 
the angle g between PE and either d-line through P that is parallel to the 
d-line through A and B. 

4. Let AABC be a d-triangle with ZABC = 7/3, and let the sides BA and BC 
have the same hyperbolic length c. Let BD be the altitude from B to the 
side AC. Show that the hyperbolic length b of AC is greater than c. 

Hint: Apply the Sine Formula to triangle ABDA. 
5. Show that if AABC is a d-triangle right-angled at C, then 


sinh 2a 
sinh 2c’ 
where a and c are the hyperbolic lengths of the sides BC and AB. 


Hint: If tang =f, then sin? ¢ = a. 
6. Determine the hyperbolic lengths of the sides of the d-triangle with angles 
a 
2°6 


sin A = 


a 
and a 


Section 6.5 
1. Determine the hyperbolic area of the d-triangle AABC with vertices at the 
points A(0,0), B (4.0) and C (0. s). 
2. Let A, B, C and D be the points (0,0), (1,0), (0,1) and (—1, 0), respectively, 
and let @ be the d-line BC. 
(a) Determine the vertices of the d-triangle that is the image of the 
d-triangle AABC under hyperbolic reflection in @. 
(b) Determine the vertices of the d-triangle that is the image of the 
d-triangle AACD under hyperbolic reflection in @. 
(c) Sketch the d-triangles AABC and AACD, and their images under 
hyperbolic reflection in @. 


ABCD is a hyperbolic 
square of side-length 1. 


Proof of the Sine Formula 
(Subsection 6.4.3, 
Theorem 10). 
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(d) Determine the areas of the d-triangles AABC and AACD, and of their 
images under hyperbolic reflection in @. 
Hint: You may assume that Theorem 4 of Subsection 6.5.1 holds for 
asymptotic triangles, with K = 1. 
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Introduction 


1. 


The (Euclidean) Parallel Postulate Given any line @ and a point P not 
on €, there is a unique line m in the same plane as P and £ which passes 
through P and does not meet ¢. 


. The (Elliptic) Parallel Postulate Given any line @ and a point P not on @, 


all lines through P meet @. (That is, there are no lines through P that are 
parallel to @.) 


. The (Hyperbolic) Parallel Postulate Given any line @ and a point P not 


on £, there are at least two lines m through P that do not meet ¢. (That is, 
there are at least two lines through P that are parallel to €.) 


Section 6.1: Hyperbolic Geometry: the Disc Model 


1. In (Poincaré’s model of) hyperbolic geometry, points (or d-points, where 


‘d’ stands for ‘disc’) consists of the points in the unit disc ZY = 
{z: [zl <1} = (@,y) sx? + y? < I}. 

The points on the boundary @ = {z: |z| = 1} ={(, y):x7+y=1} 
of F are not points that belong to the geometry. 


. A d-line (or line in hyperbolic geometry) is that part of a (Euclidean) 


generalized circle which meets @ at right angles and which lies in %. 
A d-line may be an arc of a (Euclidean) circle or a Euclidean line 
segment that is a diameter of Y. The points where the correspond- 
ing generalised circle meets @ are called the boundary points of the 
d-line. 

d-lines that are arcs of Euclidean circles all lie in one-half of the disc 
J, and all curve away from the diameter bounding the half-disc. 

The equation of a d-line @ is of the form ax + by = 0, where a and b 
are not both zero, or x* + y? + fc + gy + 1 =0, where f? + g? > 4. 


. Two d-lines that do not meet in Y are parallel if the generalized Euclidean 


circles of which they are parts meet at a point on @, or ultra-parallel 
if the generalized Euclidean circles of which they are parts do not meet 
on @. 

Corresponding to any given d-line @ and any point P in F that is not 
on £, there exist exactly two d-lines through P that are parallel to £, and 
infinitely many d-lines through P that are ultra-parallel to @. 


. Let £ be a d-line that is part of a Euclidean circle C. Then inversion in C 


maps @ onto @, and J onto J. 
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5. A hyperbolic reflection in a d-line £ is the restriction to J of the inversion 
in the generalized circle C of which ¢ is part. 

The centre of such a circle C is the point of intersection of the tangents 
to @ where @ and C intersect. 

6. Hyperbolic transformations are the composites of a finite number of 
hyperbolic reflections, and form a group called the hyperbolic group, 
Gg, under the operation of composition of functions. 

Hyperbolic geometry consists of the unit disc Y together with the 
group Gg of hyperbolic transformations. 

7. The (hyperbolic) angle between two curves through a given point A in 
@F is the Euclidean angle between their (Euclidean) tangents at A. 

8. Hyperbolic transformations map d-lines onto d-lines, and preserve the 
magnitudes of angles. 

9. Origin Lemma Let A be a point of J other than the origin O. Then there 
exists a d-line @ such that hyperbolic reflection in € maps A to O. 

10. Let A be a point of Y. Then there exist infinitely many d-lines through A. 

Let A and B be two distinct points of Z. Then there exists a unique 
d-line £ through A and B. 

11. Let A; and Az be any two points of Y, and let 2; and £2 be d-lines through 
A, and Ag, respectively. Then there is a hyperbolic transformation which 
maps A, to Az and €, to é2. 

12. Let A, B € F be inverse points with respect to the d-line @, and let A’, B’ 
and ¢’ be the images of A, B and ¢ under hyperbolic reflection in another 
d-line €*. Then A’ and B’ are inverse points with respect to inversion 
in €’. 


Section 6.2: Hyperbolic Transformations 
1. Hyperbolic reflection p in the d-line @ which is part of a Euclidean circle C 


with centre a is given by the formula 


z—1l 
—,zEQJ. 
=a 


p(zZ) = 


Hyperbolic reflection is self-inverse. 

Hyperbolic reflection o in the diameter of ZY on which y = x tané@ is 
given by the formula o(z) = a7, z € Z, where a = cos@ +i sing. 

2. The composite of the cee reflections p(z) = & ead and o(z) = pe 


—1)z: 


3. The restriction to Y of every Mobius transformation of the form M(z) = 
ae > with |b| < |a|, is a composite of two hyperbolic reflections, and is 
therefore a hyperbolic transformation. 

4. Every hyperbolic transformation can be expressed as a composite of at most 
three hyperbolic reflections. 

5. The composition of an even number of hyperbolic reflections does not 


change the orientation of the unit disc, and is called a direct hyperbolic 
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transformation. A direct hyperbolic transformation can be expressed as a 
composite of at most two hyperbolic reflections. 

Any direct hyperbolic transformation is given by z+» M(z) = 
QJ, for some a, b with |b] < |a|. 

The composition of an odd number of hyperbolic reflections changes 
the orientation of the unit disc, and is called an indirect hyperbolic 
transformation. 

Any indirect hyperbolic transformation is given by z +» M(z), where 
M(z) = ie z € Y, for some a,b with |b| < |al. 

6. The composite of hyperbolic reflections in d-lines €; and 2 that intersect 
at a point in J is called a (hyperbolic) rotation and has exactly one fixed 
point in J. 

The composite of hyperbolic reflections in d-lines €; and £2 that do not 
meet in Y but do meet at a point on @ is called a (hyperbolic) limit 
rotation, and has no fixed points in Z but leaves one point of @ fixed. 

The composite of hyperbolic reflections in d-lines €; and £2 that do not 
meet in J or on @ is called a (hyperbolic) translation, and has no fixed 
points in F or on @. 

The composite of two hyperbolic translations may depend on the order 
in which they are applied. 

7. A direct hyperbolic transformation M can be written in the (canonical) form 
M(z) = K {53,, where K and m are complex numbers with |K| = 1 and 
me J. 

Conversely, every direct hyperbolic transformation M that maps a point 
m of & to the origin is of this form. 

A direct hyperbolic transformation M that maps the diameter (—1, 1) 
onto itself is of the form M(z) = ae where m € (—1, 1). 

An indirect hyperbolic transformation M can be written in the (canoni- 
cal) form zt K a where |K| = 1 andm e @&. 

8. Strategy To determine the general form of the direct hyperbolic transfor- 
mation that maps one point p in Y to another point g in F: 

1. write down the general form of the direct hyperbolic transformation M, 
that maps p to 0, and a matrix A, associated with M1; 
2. write down a direct hyperbolic transformation M2 that maps gq to 0, and 

a matrix A» associated with Mo; 

3. form the matrix product A; "A, associated with the direct transfor- 
mation M, ‘om 1, and hence write down the general form of the 
transformation. 


az+b 
bz+a’ 


ZE 


Section 6.3: Distance in Hyperbolic Geometry 


1. Properties of a Distance Function d 
1. d(z1,2Z2) = O for all z; and z2; 
d(z1,22) = O if and only if z} = zo. 
2. d(z1,22) = d(z2, 21) for all z; and zp. 
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3. d(z1, 73) + d(z3,22) = d(z1,22) for all z}, zg and z3. (Triangle 
Inequality) 

4. d(z1, 73) + d(z3, 22) = d(Z1, 22) if and only if z;, z2 and z3 lie in this 
order on a line. 

Additional Properties of the Distance Function d in Hyperbolic 

Geometry 

5. d(z1,z2) = d(Z, Z2) for all z; and z2 in J. 

6. d(z1,z2) = d(M(z1), M(z2)) for all z; and zo in Y and all direct 
hyperbolic transformations M in Gg. 

2. The hyperbolic distance d(0,z) between the origin and any point z € 
QZ is given by d(0,z) = tanh! (|z|) = 5 log, (44). Also, |z| = 
(e24 0.2) _ 1) / (e242) 649, 

In general, the hyperbolic distance between two arbitrary points z; and 


£2721 


zo in Dis given by d(z1,z2) = tanh”! ( are 

3. For small z, d(0,z) ~ |z|; as |z| — 1, d(0,z) — +00; for z # 
0, d(0,z) > |zI. 

4. A point m in Z is the hyperbolic midpoint of the hyperbolic line segment 
joining a and b in @ if m lies on this segment and d(a,m) = d(m,b) = 
5d(a, b). 

5. Strategy To find the hyperbolic midpoint of a hyperbolic line segment 
joining two points p, g on a diameter of Y, where |p| > |q|: 

1. calculate d(0, p) and d(0,q); 

2. if p and q lie on opposite sides of O, then calculate d = 
5(d(0, p) — d(0,q)); otherwise calculate d = 5(d(0, p)+d(0,q)); 

3. then the hyperbolic midpoint is the point m on the radius through O and 
p at a Euclidean distance tanh(d) from O. 

6. The hyperbolic circle with hyperbolic radius r and hyperbolic centre c 
is the set {z: d(c,z) =r, z€ J}. 

7. Every hyperbolic circle in Y is a Euclidean circle, and vice versa. 
Strategy To find the Euclidean centre and radius of a hyperbolic circle C 
with hyperbolic centre m: 

1. find the points a,b where Om meets C; 

2. the Euclidean centre of C is the Euclidean mid-point of ab; 

3. the Euclidean radius of C is sla — bj. 

Strategy To find the hyperbolic centre and radius of a Euclidean circle K 
with Euclidean centre p: 

1. find the points a,b where Op meets K; 

2. the hyperbolic centre of K is the hyperbolic mid-point of ab; 

3. the hyperbolic radius of K is 5d (a,b). 

8. Let A and A’ be points in the unit disc Y that are images of each other under 
reflection in a d-line €. Then @ is the (hyperbolic) perpendicular bisector of 
the hyperbolic line segment AA’. 

9. Reflection Lemma Let p and g be points of Y. If |p| # |q|, then 
the hyperbolic reflection that maps p and q onto each other is given by 
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M(z) = (az — 1)/@ — @), where 
a=(p—q+pq(P-])/ (PP - 99). 


The d-line in which this reflection takes place has equation 


x? 4+ y? — 2ax — 2by + 1 = 0, where w = a + ib. 


If |p| = |q|, then the reflection that maps p and gq onto each other is 
reflection in the diameter of J that bisects the angle ZpOq. 


Section 6.4: Geometrical theorems 


1. 


A d-triangle consists of three points in the unit disc F that do not lie on a 
single d-line, together with the segments of the three d-lines joining them. 


. The sum of the angles of a d-triangle is less than z. 


Each external angle of a d-triangle is greater than the sum of the 
opposite two internal angles. 

Small triangles have angle sums close to (but less than) 7r, and triangles 
with large areas have angle sums close to zero. 


. A d-quadrilateral ABCD consists of four points A, B, C, D in F (no 


three of which lie on a single d-line), together with the segments AB, BC, 
CD and DA of the four d-lines joining them. We also require that no two 
of these segments meet except at one of the points A, B, C or D. 

The sum of the angles of a d-quadrilateral is less than 277. 


. Inad-triangle AABC the angles ZABC and ZACB are equal if and only if 


the sides AB and AC are of equal (hyperbolic) length. 


. Two figures in the unit disc Y are d-congruent if there is a hyperbolic 


transformation that maps one onto the other. 
Similar d-triangles (that is, d-triangles with corresponding angles equal) 
are d-congruent. 


. An asymptotic triangle is a d-triangle, except that one or more of its 


vertices lie on @ rather than in Z. Simply asymptotic, doubly asymptotic 
and trebly asymptotic triangles have 1, 2 and 3 vertices on @, respectively. 

The angle sum of an asymptotic triangle is less than 2. The angle sum 
of a trebly asymptotic triangle is zero. 


. Let ¢ and €’ be two d-lines, and suppose that there exist points A on € and 


A’ on £’ such that the d-line segment AA’ meets £ and ¢’ at right angles. 
Then AA’ is a common perpendicular to ¢ and ¢’. 

Common Perpendiculars Theorem Two d-lines have a common 
perpendicular if and only if they are ultra-parallel. This common perpen- 
dicular is unique. 


. Through any point P of F not on a d-line £, there is a unique d-line that 


meets ¢ at right angles. 

Let @ be a d-line which passes through one vertex A of a d-triangle 
AABC and which is perpendicular to the side BC at the point D. The 
d-line segment AD is an altitude of the triangle AABC. 

Altitude Theorem Let the sides AB and AC of a d-triangle AABC be of 
equal hyperbolic length, and let the angle at A be 6. Then the hyperbolic 
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length of the altitude through A of the triangle is less than some number 
that depends only on 6. 

9. Pythagoras’ Theorem Let AABC be a d-triangle in which the angle at C 
is aright angle. Ifa, b and c are the hyperbolic lengths of BC, CA and AB, 
then cosh 2c = cosh 2a x cosh 2b. 

10. Lobachevskii’s Formula Let the d-triangle AABC have a right angle at 
C, and let the hyperbolic lengths of AC and BC be b and a, respectively. 
Then tan A = tanh 2a /sinh 2b. 

11. Angle of Parallelism Let @ be a d-line, and P a point of F that does 
not lie on £. Then the angle g between the perpendicular from P to ¢ (of 
hyperbolic length p) and either d-line through P that is parallel to ¢ is 
given by tang = 1/sinh 2p. 

12. Sine Formula Let AABC be a d-triangle right-angled at C. Let a and 
c be the hyperbolic lengths of BC and AB, respectively. Then sinA = 
sinh 2a / sinh 2c. 

13. Let ¢ be a d-line that ends at points A and B on @, and let ¢’ be that part 
in Y of a (Euclidean) circle through A and B, and let a be the angle 
(in Z) between @ and £’. Then all points on ¢’ are the same distance 
5 log, {tan($ + 7)} from é. 

14. For x € (—1,1), the following are all equal to tanh! x: 5 log, ie 


1-x’ 
2x 1 14x? 
1—x2’ 1—x2° 


1 tpl 1 = 
5 sinh 5 cosh 


Section 6.5: Area 


1. Area in hyperbolic geometry has the following properties: 

e the area of a d-triangle is non-negative, and is zero only if its Euclidean 
area 1s zero; 

e d-congruent d-triangles (and d-congruent asymptotic d-triangles) have 
the same area; 

e if one d-triangle can be fitted inside another, it has a smaller area; 

e area is additive. 

2. All trebly asymptotic d-triangles are d-congruent to each other. 

3. Let AABC be a d-triangle. Then there exists a trebly asymptotic d-triangle 
ADFEF that contains AABC. 

4. All trebly asymptotic d-triangles have the same (hyperbolic) area, and this 
is a finite number. 

5. The area of a d-triangle with angles a, 6, y is given by 
K(a — (a+ 8+y)), where K is the same constant for all d-triangles. It 
is usual to take K = 1, so that the area of a d-triangle is its angular defect. 

Let a, 6, y be positive real numbers withO <a+6+y < 2. Then 
there is a d-triangle in Y with angles a, 6, y. 

6. Let P be a point on a d-line @ of Y, and let 0 < B < oo Then there are 
exactly two d-lines through P that make an acute angle f with @, and each 
is a (hyperbolic) reflection of the other in @. 
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7. Tessellate means ‘fit together exactly a number of identical shapes, leaving 
no spaces’. 

Tessellations of the Euclidean plane using a single figure are sometimes 
called wallpaper patterns. There are only 17 different wallpaper patterns, 
in the sense that the symmetry group of any wallpaper pattern is isomorphic 
to one of only 17 non-isomorphic groups. 

8. A kaleidoscope is a machine that produces wallpaper patterns using 
reflections in mirrors. 

Successive reflections of a point in two plane mirrors that meet at an 
angle z / k give 2k points in all. 

Successive (hyperbolic) reflections of a point in two mirrors along d- 
lines in Z that meet at an angle z 7 k give 2k points in all. We may obtain 
a hyperbolic tessellation of Z (with d-congruent d-triangles) by succes- 
sive hyperbolic reflections starting from a d-triangle in Z formed by three 
mirrors along d-lines at angles of z / 7 (at the origin), 7 / 2 and x / o 


Section 6.6: Hyperbolic Geometry: the Half-Plane Model 


1. In the half-plane model, the set of points is the upper half-plane H = 
{z: z€C,Imz > 0}; h-lines are the restriction to H of vertical lines in 
the plane together with the restriction to H of circles with centre on the 
real axis in C; and the hyperbolic transformations are the mappings of H 
to itself given by z +> (az+ b)/ (cz +d)orzh (-az+ b) [(-cz +d), 
where a, b, c, d are real and ad — bc > 0. 


Elliptic Geometry: the Spherical. Model 


In the introduction to Chapter 6, we saw that in Elliptic Geometry all the postu- We usually take the sphere 
lates of Euclidean Geometry hold except that the Euclidean Parallel Postulate to have radius 1. 
is replaced by the following. 


The (Elliptic) Parallel Postulate Given any line @ and a point P not on 
£, all lines through P meet @. (That is, there are no lines through P that are 
parallel to £.) 


In this chapter we study the geometry of the sphere (which we call S$”), such 
as the surface of the Earth, as a model of elliptic geometry. s° 


A ‘line’ joining two points P and Q on the surface of a sphere is the curve 
of shortest length (that is, a geodesic) on the sphere between the points: it is 
actually the great circle (that is, the intersection of the plane through the centre great circle 
O of the sphere with the surface of the sphere) through the two points. Any 
two distinct great circles meet in two (diametrically opposite) points, so any 
two lines in spherical geometry meet in two points. 
The group of transformations in spherical geometry is the group of isome- 
tries of the sphere. Clearly a rotation of the sphere is an isometry, and we shall 


see that the group of isometries is composed of rotations of the sphere together s° 
with reflections in great circles. It turns out that any orientation-preserving 
isometry of the sphere is in fact a single rotation. Recall that an isometry is 


In Section 7.1, we establish a system of coordinates for the sphere; these 4 one-one mapping that 
are essentially the familiar coordinates of latitude and longitude. Then, in Preserves distances. 
Section 7.2, we obtain matrix representations for the isometries of the sphere: 
every isometry can be written as a 3 x 3 matrix. 

Next, in Section 7.3, we define a triangle on the sphere, find a formula 
expressing its area in terms of the sum of its angles, and show that every spher- 
ical triangle has a dual triangle whose sides and angles have the magnitudes 
of the angles and sides, respectively, of the original triangle. 

There is an attractive formula for the distance between two points on the 
sphere in terms of their coordinates. There is also an attractive generalization 
of Pythagoras’ Theorem in spherical geometry, and in proving it we are led 
to the trigonometric formulas connecting the sides and angles of a spherical 
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Spherical Space 


triangle. These establish that of the six measurements that describe a spheri- 
cal triangle (three side lengths, three angles) any set of three determines the 
other three. This is not the case in Euclidean geometry (where similar, non- 
congruent figures exist), but it is also true in hyperbolic geometry. Indeed, there 
are closer analogies between spherical geometry and hyperbolic geometry than 
there are between either geometry and Euclidean geometry. 

Many aspects of spherical geometry may be studied more easily if we first 
map the sphere onto a plane. This can be done in many ways, although, none 
of the mappings (or projections) is an isometry of the sphere onto the plane. In 
Section 7.4, we revisit stereographic projection, which maps the sphere onto 
the plane in a way that preserves angles and generalized circles. We also show 
how to obtain the group of isometries of the stereographic image of the sphere 
on the plane; and revisit coaxal circles. Finally, in Section 7.5, we look briefly 
at the problem of mapping the spherical Earth onto a flat map. 


7.1. Spherical Space 


7.1.1. Spherical Geometry 


Spherical geometry is the study of geometry on the surface of a sphere. It is 
simplest to choose this sphere to have unit radius; we therefore concentrate on 
what we call the unit sphere in Euclidean 3-dimensional space R?, namely the 
sphere whose centre is at the origin and which has radius 1. In terms of the 
usual x-, y-, z-Cartesian coordinates in R? with origin O at the centre of the 
sphere, the unit sphere has the equation x* + y* + z? = 1. We shall denote it 
by the symbol S?. 

The positive x-, y-, z-axes meet the sphere S* at the points A(1,0,0), 
BO, 1, 0), C(O, 0, 1), respectively. For obvious reasons the point C is 
sometimes called the North Pole of the sphere, and denoted by N. Every 
point (x, y, z) on the sphere S? has a diametrically opposite or antipodal point 
(—x,—y, —z); for example, the antipodal point to the North Pole is the South 
Pole S(0,0,—1). The plane through O perpendicular to the line ON cuts the 
sphere in a circle called the equator. 


Example1 Prove that any plane that cuts the sphere S* in more than one point 
cuts it in a circle. 


Solution Let OT be a radius of the sphere perpendicular to the given plane. 
A rotation of the sphere about this axis maps the plane to itself, and the sphere 
to itself; so it maps their intersection to itself. Each point on the intersection 
traces out a circle (of the same radius and centre) under the rotation, so the 


intersection is a circle. 


425 


It is usually straight- 
forward to modify our 
results to the case of a 
sphere of arbitrary radius, 
R say. 


Antipodal points 
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The analogue in spherical geometry of a straight line in the plane is a curve 
on S called a great circle. 


Definition A great circle is the circle cut out on the sphere S* by a plane 
through the centre of the sphere. 


Note that, by Example 1, a great circle is indeed a circle. Moreover, it is a 
circle of maximal radius 1 on the sphere, which gives rise to the name ‘great’ 
circle. 


| | Taste VY > 
Definitions Any great circle through JN is called a circle of longitude. 
The part of the circle of longitude through A(1, 0,0) which has positive x- | 
coordinates we call the Greenwich Meridian. The curve of intersection of Ney, 
S? with a plane parallel to the equator is a circle called a circle of latitude. cn 
The curve of intersection of S? with a plane that is not a great circle is called S 
a little circle. Greenwich meridian 


The equator is a circle of latitude; circles of latitude other than the equator 
are little circles. 
We define distance in spherical geometry (on S7) in the natural way. 


Definitions Let P and Q be any two points on S’, and C a great circle 

through P and Q. Then either arc with endpoints P and Q is said to be a Thus ‘the line PQ’ is not 
line on S? joining P and Q, or a line PQ. The length of a given line on —unique. 

S? is defined to be the Euclidean length of the corresponding circular arc; 

the distance (or spherical distance) between two points P and Q on S? is 

the length of the shorter of the two lines PQ, and is often denoted simply 


as PQ. 
Let P, Q and R be any three points on S” that do not lie ona single great Thus there are several 
circle. Then a line PQ, a line QR and a line RP form a triangle APOR. triangles APQR. 


A property of great circles that we state without proof is that great circles 

are geodesics on S*; that is, curves of shortest length between any of their 

points. This is intuitively clear for the circles of longitude from the North Pole, 

for instance. Note that little circles are not geodesics on S*. S 
The distance between any two points on S? is a portion of a great circle 3 

through the points. This circle has radius |, and so has circumference 271 — the 

same as the angle that the great circle subtends at its center (the centre of the 

sphere). It follows, by proportion, that the distance between any two points on great circle 

S? is equal to the angle that they subtend at the centre of S?. 


Theorem 1 The distance between any two points P and Q on S? is equal 
to the angle that they subtend at the centre of S?. 


Spherical Space 


Thus, for example, the distance from the North Pole to any point on the 
equator is 2/2, and the distance from the North Pole to the South Pole is z. 

With the above definition, spherical distance possesses all the ‘natural’ prop- 
erties that a distance function should possess. (We listed these in Subsection 
6.3.1.) 

For example, it satisfies the Triangle Inequality; that is, if P,Q and R are 
any three points on S7, then the side lengths of the triangle APQR satisfy the 
inequality PO+ QR > PR. This can be proved in a similar way to the Triangle 
Inequality in hyperbolic geometry (Theorem 2, Subsection 6.3.3). 

Great circles in spherical geometry play many of the roles that lines play in 
Euclidean plane geometry. There is always a great circle joining any two points 
on S*, because there is always a plane passing through them and the origin. 
The plane and the corresponding great circle are unique unless the points are 
diametrically opposite, in which case there are infinitely many great circles 
joining the points. 

However, one way in which spherical geometry differs markedly from 
Euclidean plane geometry comes from observing that any two great circles 
meet not in one, but in two (diametrically opposite) points. They do so because 
the planes that define them have a common point, the origin, and therefore a 
common line in R?, which cuts S* in the given points. So although great cir- 
cles are analogous to the straight lines of the plane, the analogy breaks down 
almost at once. There are certainly no ‘parallel lines’ on the sphere, because 
any two great circles meet in two points. 

It is sometimes helpful to observe that corresponding to each point P on S” 
there is a unique great circle of points 2/2 distant from P, which plays the role 
of the equator with P as a ‘pole’. We shall call this great circle the great circle 
associated with the point P. Conversely, given any great circle there are two 
points of $* which play the role of its ‘poles’. 


Example2 Let P and Q be any two points on $*. Show that there is a rotation 
of S? that maps P to Q. 


Solution Consider the great circle through P and Q, and let T be one of its 
poles. The rotation of S? about the axis OT and through the angle ZP OQ (in 
the appropriate direction) sends P to Q, as required. 


Problem 1 Determine whether rotations of S* exist with the following 
properties. 


(a) (1,0,0) (0, —-1, 0) 
(b) (1,0,0) + (0,1,0) and (0, 1,0) + (0,0, 1) 
(c) (1,0,0) + (0,—1,0) and (0,0, 1) F (0, 1,0) 
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S 
P 
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‘p 
The antipodal points P and P' have 
a common equator 


a 
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7.1.2 Coordinates on $2 


We now set up a system of spherical polar coordinates on S*. We start by 
defining the direction along the equator from A(1,0,0) to B(O, 1,0) to be the 
positive direction on the equator. Then for any point P; on the equator let @ be 
the angle ZAO P; measured in the positive direction along the equator; then 
we assign the coordinates (cos ¢, sing, 0) to P}, where 0 < @ < 27. 

Next, let P be any point on S? (other than the North and South Poles) that 
does not lie on the equator. Let P; be the point on the equator that lies on the 
arc NS of the great circle through N and P. 

Then let ¢ be the angle 7A O P; measured in the positive direction along the 
equator (0 < @ < 277), and let 6 be the angle 7NOP (0 < 6 < x); we assign 
the coordinates (cos ¢ sin 8, sing sin, cos @) to P. 

For example, the point of S* with spherical polar coordinates 


Won ois Tt w\: : 1 V3 1 
cos + sin4, sin 4 sin+, cos is the point (5 ~=, +3): 
(cos 3 sin 3 SIN | t) Pp 2/2? 2/2? V2 


Problem 2 Determine the angles ¢ and 0 in the spherical polar repre- 
sentation (cos @ sin, sin @ sin 8, cos @) of the points (0, = 4) and 


(4 _ 2 +) 
via 14? 14)" 

In this way every point on S? other than the North and South Poles has been 
given spherical polar coordinates involving appropriate values of @ and @. At 
the North Pole we take 6 to be zero and let @ take any value in the interval 
[0, 277), and at the South Pole we take @ to be z and let ¢ take any value in the 
interval [0, 277); the non-uniqueness of coordinates at the Poles will cause us 
no problems. 


Definitions Let P be a point on S* with spherical polar coordinates 
(cos @ sin@, sing sin@,cos@), where 0 < @ < 27m andO < 6 < zw. The 
angle @ of P is called its colatitude; the quantity 6’ = + — @ is called its 
latitude (note that —5 < 6’ < 4). The angle ¢ of P is called its longitude. 


Notice, for example, that the distance of a point P with colatitude @ on S* 
from the North Pole is simply 0. 


Problem 3 Determine the latitude, colatitude and longitude of the 
ad we A 2 
point (+ Wo 4) on S*. 


Hint: You met this point just before Problem 2 above. 


Problem 4 Determine the spherical polar coordinates of the point 
45° W of the Greenwich meridian on S? and 30° S. 


Notice also that the set of points on the sphere with a common colatitude 
@ forms a circle, Cg say; this is a little circle if 9 A m/2, and a great circle 
(the equator) if 9 = 2/2. Since 6 measures the distance of each point of the 
circle Cg from the North Pole N, we shall say that Cg is a circle on the sphere 
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a] 


nV 


Equatorial plane of Ss 


P has spherical coordinates 
(cos @ sin 8, sin @, sin 0, cos 0). 


Sometimes we shall find it 
convenient to regard @ as 
being defined on R rather 
than on [0, 277), as a 
periodic function with 
period 27. 


Note that 30° S means 
“30° below the equatorial 
plane’. Also, sometimes 
we take the phrase ‘of the 
Greenwich meridian’ as 
understood when 
describing longitude. 


Spherical Transformations 


with centre N. Since any point on S? may be chosen as the North Pole (with 
a suitable choice of coordinate axes), it follows that a circle on $ 2\ defined as 
the set of points equidistant from a given point, is either a great circle or a little 
circle. 


Problem 5 Show that, given any two great circles on S*, there are two 
great circles bisecting the angles between them, each at right angles to 
the other. 

Hint: The angle between two curves on S? at one of their points of 
intersection is the angle between their tangents at that point. 


7.2. Spherical Transformations 


7.2.1 Isometries of S? 


An isometry of S? is a mapping of the unit sphere to itself that preserves 
distances between points. In fact the isometries of a sphere form a group, 
since 


e the composition of two isometries is also an isometry; 

e the identity mapping is an isometry, and it is the identity element of the 
group; 

e the inverse of any isometry is an isometry; 

e the composition of isometries is associative, since the composition of 
functions is always associative. 


We have therefore established the following theorem. 


Theorem 1 The isometries of S* form a group. 


We call this group the group of spherical isometries, and denote it by the 
symbol $(2). 

We shall see at the end of this section that the rotations of S* form a group, 
which is identical to the group of direct isometries of the sphere. A reflection 
of S* in a plane through the centre of the unit sphere is not a direct isometry 
of S*, but every composition of an even number of such reflections is a direct 
isometry of S?. 

We now consider the three elementary rotations of S? (that is, its rotations 
about the coordinates axes) and their composites. 

The elementary rotation about OZ leaves the z-axis fixed, and rotates the 
(x, y)-plane. The positive direction of this rotation is anticlockwise as seen 
from above (that is, as viewed from a point of the positive z-axis), from the pos- 
itive x-axis towards the positive y-axis. So a rotation of S* about OZ through 
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tangent 


tangent 


A transformation is direct 
if it preserves the 
orientation of angles, and 
indirect if it reverses them. 
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an angle y, which we denote by R(Z, v), is given by 


R(Z,y): S* > S? 


xb> Ax 
x cosy —siny 0 
where x = | y | andAisthe matrix | siny cosy 0 
zZ 0 0 1 


Next, the elementary rotation about OX leaves the x-axis fixed, and rotates 
the (y, z)-plane. The positive direction of this rotation is taken to be anticlock- 
wise as seen from above (that is, as viewed from a point of the positive x-axis), 
from the positive y-axis towards the positive z-axis. So a rotation of S* about 
OX through an angle a, which we denote by R(X, q@), is given by 


R(x, a): S? > S? 


xh Ax 
x 1 0 0 
where x = | y | and A isthe matrix | 0 cosa —sina 
Zz QO sine cosa 


Finally, the elementary rotation about OY leaves the y-axis fixed, and 
rotates the (x, z)-plane. The positive direction of this rotation is taken to be 
anticlockwise as seen from above (that is, as viewed from a point of the posi- 
tive y-axis), from the positive z-axis towards the positive x-axis. So a rotation 
of S* about OY through an angle 6, which we denote by R(Y, B), is given by 


RY, p28 = 8? 


xh» Ax 
x cosB 0O- sinB 
where x = | y | and A is the matrix 0 1 0 
Zz —sinB O cosp 


It is easy to verify that the determinants of the elementary rotations R (X,@), 
R(Y,) and R (Z, y) are all equal to 1. Since any rotation can be expressed 
as a finite composition of these rotations, it follows that the determinant of the 
matrix of any rotation of S* also has determinant 1. 


Problem 1 Determine the images of the points (1,0,0), (0, 1,0), 


(0, 0, 1) and (sy are +) under the transformation R Os z). 


Notice that the simple cases of these rotations when a,8 or y = 
correspond to rotations of one axis onto another. For example, 


m/2 


cos 5 0) sin 5 0 0 1 
R(Y,5) = Oo 1t OU fal 0 Lt Os 
— sin > 0 cos 5 -1 0 0 


under this mapping, (1,0,0) Fh (0,0,—1),(0,1,0) te (0,1,0) and 


(0,0, 1) rH (1,0, 0). 
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We say that A is the 
matrix associated with (or 
of) the elementary rotation 
R(Z,y). 


Here A is the matrix 
associated with (or of) the 
elementary rotation 

R(X, a). 


Here A is the matrix 
associated with (or of) the 
elementary rotation 


R(y, B). 


We will prove this fact in 
Theorem 7, Subsection 
7.2.3. 
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Example 1 Show that R (X,3)-R(Y,3) AR(Y,5)- R(X, 4). This shows that 
multiplication of the 
Solution Here matrices of elementary 
rotations (and hence 
I 0 0 composition of 
R (x, uy) -R (Y, 5) =|0 cos 5 — sin z elementary rotations) is 
0 sin cos¥ not commutative. 


0 1 
= 0 Of], 
1 O 
and 
cos> 0 sind 
R(Y,%)- R(X, 3) = 0 1 0 
—sint 0 cost 
1 0 0 
x 10 cos 5 — sin 5 
0 sin 5 cos 5 
0 0 1 1 O 0 
= 0 1 0 00 -1 
-1 0 0 0 1 0 
0 1 0 
= 0 0 —-!I 
-1 0 0 


Thus R (X, a) -R(Y, 5) ARY, 5) - R(X, 5), as required. 


Problem 2 Calculate R(Z,y)- R(Y,B) and R(Y, B)- R(Z,y). 


Problem 3 For the elementary rotation R(X, a), verify that the trans- 
pose of its matrix is equal to the matrix of the elementary rotation 
R(X, —a). State and prove the corresponding results for the elementary 
rotations R(Y, 6) and R(Z, y). 


Notice that the inverse of an elementary rotation through a given angle is 
an elementary rotation through the same angle but in the opposite direction; 
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that is, 

R(X,a)~! = R(X, —a), 
R(Y, B)' = R(Y,—8) 
R(Z,y)7' = R(Z,—y). 


and 


Combining this observation with the result of Problem 3, we obtain the 
agreeable consequence that the inverse of a matrix representing an elemen- 
tary rotation is equal to the transpose of that matrix. For example the matrix A 
of the elementary rotation R(Y, B) is 


cosB 0 sin 
A= 0 1 0 ; 
—sinB 0 cosp 


then A~! = AT, 

We saw earlier that we can rotate S* in such a way as to map any particular 
point onto any other. We now see how we can determine the matrix of such a 
rotation explicitly. 

First, we see how to rotate S* so as to send the point A(1,0,0) to any point 
P(cos@¢ sin@,sin@ sin 6, cos @) of S*. First we rotate S? to send A to the point 
Pi (cos ¢, sin d, 0); then we rotate S? to send P; to P. 

The first rotation is the elementary rotation R(Z,@); but what is the sec- 
ond rotation? It is a rotation of S? ‘towards the North Pole’ through an angle 
6’ = > — 8, so it is something like R(Y,—6') or R (Y,0 - 5). But it is not 
a rotation about the y-axis; instead, it is a rotation of S* about the direction 
where the first rotation sent the y-axis. We may perform such a rotation in sev- 
eral stages: rotate the sphere first by the reverse transformation R(Z, —@), then 
by the elementary rotation R (Y O- 5), and then finally by the transformation 
R(Z,®@). 


Thus the second rotation is the composition mapping 
R(Z,$)R(Y, —0')R(Z, —$) = R(Z,b) RY, —0')R(Z, b) |. 
It follows that the final transformation that sends A to P is given by 


(RZ, g) RY, -6')R(Z,b)-") R(Z,g) = R(Z,g)R(Y, —6'). 


Theorem 2 A rotation of S? that maps A(1,0,0) to P(cos¢sin0, 
sing sin@,cos@) is given by the composition R(Z,#)R(Y,—6’), where 
gv =2_8@, 

2 


Problem 4 Determine the matrix A of a rotation x 1 Ax of S? 


ode wie, A i i 
that maps A(1,0,0) to P (; Ja? 2J2" 4). Verify your answer by direct 
calculation. 


Hint: You met this point P in Problem 3 of Subsection 7.1.2. 
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Recall that any square 
matrix A with the 
property that At=al 
or AA? = Tis an 
orthogonal matrix. 


Subsection 7.1.1, 
Example 2 


From A to Pi and from Pi 
to P by rotations 


Note that there are many 
such rotations. Any 
composition of this 
particular rotation with a 
subsequent rotation about 
OP has the required 
properties. 
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Example 2 Determine a rotation that maps the point P(cos@sin8@, 
sind sin 8, cos 0) of S? to N(0, 0, 1), in terms of elementary rotations. 


Solution By Theorem 2, one rotation that sends A to P is given by the 
composition R(Z, )R(Y, —6’), where 6’ = 4—6. It follows that the mapping 


(R(Z, @)R(Y,-6’)) | = RYY,-6') | R(Z, 9)! 
= R(Y,6')R(Z, —¢) 


sends P to A. 

Now at the North Pole ¢ = 0 and @ = 0, so that 6’ = 4. Hence one rota- 
tion that sends A to N is R (Y : —3). It follows that one rotation that sends 
PtoNis 


R (v.-5) o R(Y,0’) R(Z,—) 
=R(Y, o— 5) RZ, —$) 


= R(Y,—0) R(Z,—¢). 


Problem 5 Determine the matrix A of a rotation x +> Ax of S that 
maps P (5 Ase G a) to N. Verify your answer by direct calculation. 


Hint: You met this point P in Problem 4. 


Problem 6 Determine the matrix A of a rtanon x +} Ax of S? 
Lot a2 Sa. ee 
that maps Q ( 7 +3) to p(s on ae a fs). Verify your answer 


by direct calculation. 


7.2.2 Reflections in Great Circles 


Reflection in the equatorial plane of S? (that is, in the plane z = 0 that contains 
the equator of S”) is the mapping x +> Ax that sends the point x = (x, y, z) to 


10 0O 
the point (x,y,-z),soA=|0 1 0 |. Each point lies vertically above 
0 0 -l 


or below its image, unless it lies in the equatorial plane — in which case it 
coincides with its image. 


Problem 7 Evaluate the determinant of the above matrix A. 


Next, let 7 be any plane that passes through the centre O of the sphere S?. 
The equation of z must be of the form ax+by+cz = 0, since it passes through 
O; if we define the vectors a and x to be (a,b,c) and (x, y, z), respectively, 
then we can express the equation of z in the form 


a-x=0. 
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For 

R(Y, Bi) RY, Bo) 
= R(Y, Bi + Bo), 

for any angles f , Bo. 


equato! 


reflection of x in 
the plane x 
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This defines a plane with the vector a as a normal to it; without loss of 
generality, we shall assume that a,b and c have been chosen so that a has 


length 1. 
Let r be the reflection of S* in the plane zr; we shall find a matrix for this | We shall interchangeably 
mapping r of S? to itself. call this reflection in 1 


The image of a point x of S? is the point r(x). Now the line joining the 24 "¢flection in the great 


: : : -ircle in which 
points x and r(x) is a line parallel to the normal vector a, so we must have that en ts §2 


r(x) =x-+fa, for some value of the parameter f. For r(x) — x = fa, for 
some f. 
We can find this value of t by observing that since x lies on S* so does its 


image r(x), and so 
(x + ta)-(x+ ta) = 1. 
This implies that 
X-x+2tx-at+fa-a=l, 
so that For x- x = | and 
a-a=l1. 


tx-at?? =0. 


Now, if x -a = O, then the point x lies in the plane z in which we are 
reflecting, and then as you would expect t = 0 and r(x) = x. 

Ifx-a 0, then we may exclude the possibility that t = 0. It follows from For then clearly r(x) # x. 
the equation 2tx - a+ t2 = 0 that t = —2x- a, and so 


r(x) = x— 2(x-a)a. 


To represent this mapping via a matrix, we express this equation in terms of 
coordinates and group the terms carefully: 


r(x, y, Z) = (x, y,Z) a 2(ax + by + cz)(a, b,c) 
= ((1 2a”) x — 2aby — 2acz, —2abx + (1 = 2b”) y 


—2bez, —2acx — 2bcy + (1 — 2c?) z) ‘ 


so that it can be expressed in matrix form as 


x 1 — 2a? 2ab 2ac % 
y}rHl —-2ab 1-2b? —2be y 
z —2ac —2be 1—2c? Zz 


Theorem3 Reflection of S? in the plane x with equation ax+by+cz = 0, 


where a* + b* + c* = 1, is given by the mapping x +> Ax where 
1—2a* —2ab ONE 
A=|[ —-2ab 1-2b? —2bc 


—2ac he | = De” 
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Notice that in the particular case when the plane is the equatorial plane, with 
equation z = 0 (so that a = b = 0 and cc = 1), Theorem 3 gives the matrix for 
the mapping x > Ax to be 


that you met above. 


Problem 8 Determine the matrix A such that the mapping x + Ax 
represents reflection of S in the plane with equation 3x +4y—5z = 0. 


Remark 


There is another way to find the matrix representing a reflection in a great 
circle. Let the point P be one of the polar points of the plane a through O 
that meets S* in the given great circle. Let R be a rotation taking P to the 
North Pole N, and let r be reflection in the equatorial plane of S*. Then the 


oe : = : oan We omit the details of this 
composition mapping R~ ‘rR represents reflection of S* in z. 


calculation. 


We now establish a result relating products of reflections of S* to rotations 
of S?. It is the exact analogue of a result in Euclidean geometry. The result Subsection 5.2.1, margin 
holds also for general reflections and rotations of R*, so we prove it in that note alongside Problem 4 
more general setting. 


Theorem 4 The product of any two reflections of R? in planes through O 
that meet in a common line is a rotation about that common line. 


Proof By choosing coordinate axes suitably, we may arrange that the com- 
mon line is the z-axis. The planes are therefore vertical, and it follows that 
each reflection leaves the z-coordinate of each point unaltered. It is there- 
fore sufficient to look at the effect of each reflection and composite on planes 
perpendicular to the z-axis; we shall discuss the (x, y)-plane, but a similar 
argument applies to all planes parallel to this one. 

Let the first plane 77; make an angle a with the positive x-axis and the second 
plane z2 make an angle 6 with the positive x-axis. Let a point P in the (x, y)- 
plane have plane polar coordinates (r,@). Then the image of P under reflection 
in z is the point P’ with polar coordinates (r, 2a — 0). The image of P’ under 
the reflection in z2 is the point P” with polar coordinates (r,28 — (2a—6)) = 
(r,0+2(a— 6)). This is the image of P under rotation about the common line 
of 71 and 72, through twice the angle between the planes. | 


We deduce from this theorem that the composite of reflections in two great 
circles on S? is a rotation of S* about the line common to the planes defin- 
ing the great circles, and through twice the angle between them. We can also 
deduce that every rotation is a product of two reflections; these reflections are 
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in planes that have the axis of rotation as their common line, and are separated 
by half the angle of the rotation (but they are otherwise arbitrary). 


Example 3 Prove that the image of a circle C on S” under an isometry t of S* 
to itself is a circle. 


Solution Let the circle C be cut out by a plane z, and let @ be the line 
through the origin perpendicular to 7. Let £ meet S? at the points P and P’. 
Then all points of C are the same distance from P. 

Since ¢ is an isometry, the points of t(C) are all the same distance from t(P). 
It follows that they lie on a circle, which must be the circle cut out on S* by 


the plane f(z). 


We now establish the following important result. 


Theorem5 The Product Theorem 
Every isometry of S7 is a product of at most three reflections in great circles. 


Proof The key idea in this proof is the observation that an isometry of S? is 
known completely when its effect on three points (which do not lie ona common 
great circle) is known. We shall prove this observation in the course of proving 
the theorem. 


Let P, Q and R be three points not lying on the same great circle, and let 
P',Q' and R’ be their images under an isometry ft of S? to itself. Let rp be See left figure above. 
a reflection of S? in a great circle that sends P to P’. Consider the angle in See centre figure above. 
IR? with arms P’rp(Q) and P’Q’, and let C be the great circle that bisects 
this angle and the segment joining rp(Q) and Q’. Reflection in C fixes P’ and 
sends r,(Q) to Q’. Suppose that this reflection sends rp(R’) to R”. See right figure above. 
The distance of R” from P’ is the same as the distance RP (call it dp), and 
the distance of R” from Q’ is the same as the distance QR (call it dg), since 
both reflections are isometries. So the point R” lies on the intersection of the 
circle with centre P’ and radius dp and the circle with centre Q’ and radius 
dg; So too does the point R’. 
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Then it may be that R’ and R” coincide, in which case we have found a 
product of two reflections sending R to R’. Or it may be that R’ and R” do not 
coincide but a reflection in the great circle through P’ and Q’ maps R” to R’, 
and so three reflections have been used. 

We now claim that any isometry fixing three points on S? fixes every point 
of S?. Let P, Q and R be the points, let M be any other point on S?, and let 
t be the isometry fixing the points P, Q and R. Then ¢(M) lies on the circle 
with centre P and radius PM and on the circle with centre Q and radius QM. 
Of the two points satisfying this condition, only one also lies on the circle with 
centre R and radius RM, and that is the point M itself; so the image of M 
under ¢ is M, and therefore r fixes every point of S*. It follows that if two 
isometries f; and fz agree on three points, then they agree on every point, since 
the composite ft, | 6 t satisfies the conditions of our last remark. 

We deduce that the isometry mapping the points P, Q and R to the points 
P’,Q’ and R’ must be the product of the two or three reflections that we 
described above. This completes the proof. | 


Finally in this subsection we show that the Isosceles Triangle Theorem in 
spherical geometry can be established using a reflection in a great circle. 


Theorem 6 __Isosceles Triangle Theorem 
Let a triangle APOR on S? have sides PQ of length r, QR of length p, and 
RP of length q, and let p = q. Also, let the angles at the vertices P, Q and 
R bea, f and y, respectively. Then a = fp. 


Proof Consider the reflection in the great circle that is the internal bisector 
of the angle 7PRQ. 

It maps the great circle through R and P to the great circle through R and 
Q; and, since it is an isometry and p = q, it therefore maps P to Q, and Q 
to P. It maps the great circle through P and @ to itself, and so it interchanges 
the angles a and £, which shows that a = f. Oo 


7.2.3. Direct lsometries 


The Product Theorem allows us to deduce that every isometry is either a reflec- 
tion, a rotation or the composite of a rotation and a reflection. For if it is a 
composite of two reflections, then it is a rotation; and if it is a composite of 
three reflections, then the composite of the first two is a rotation. 

It is clear that every rotation of S* is an direct isometry of S*. 

We shall now see that every direct isometry of S? is a product of rotations. 
For each reflection reverses the orientation of angles, and so the product of 
two reflections preserves the orientation of angles, while the product of three 
reflections reverses it again. So a direct isometry must be the product of an 
even number of reflections and therefore be a rotation. 
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It does not follow that 
these reflections are 
uniquely determined — 
recall the case of the 
rotation discussed above. 


Theorem 5 


For the product of any pair 
of reflections is a rotation. 
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It is instructive to give a direct proof of this result that does not use the 
Product Theorem, because reflections are not easy to handle geometrically or 
algebraically. 

First, any direct isometry of S” that fixes the North Pole N must map each 
circle of latitude to itself, and so must be a rotation of that circle. Let D and E 
be points on different circles of latitude; they remain the same distance apart, 
so the rotation of their two circles of latitude must be through the same angle. It 
follows that every circle of latitude on S7 is rotated equally, and so the isometry 
fixing N must be a rotation about ON. 

Next, let M be a direct isometry that sends a point P to P’ and a point Q to 
Q’, and let R be arotation that sends P to P’. Then Mo R7! isan isometry that 
fixes P’. We may assume that P’ = N (if need be by additionally composing 
with a rotation that sends P’to N). Then the isometry M o R'= R(Z,y), by 
the remarks in the previous paragraph; so the direct isometry M = R(Z,y)oR 
is a product of two rotations. We deduce that every direct isometry is a product 
of rotations. 

We shall now see that every direct isometry of S$? is a single rotation. 

Since every 3 x 3 matrix with real entries (such as a product of rotations 
must be) must have a real eigenspace, then any such 3 x 3 matrix describing See Appendix 2. 
an isometry of S* to itself must leave the points where its eigenspace meets S* 
fixed. Since it is also a direct symmetry, it must therefore be a rotation about 
those points. It follows that every direct isometry of S* to itself is simply a 
rotation. 

We can deduce from this also the fact that the composite of two rotations of 
S? is also a rotation of S, since the composite is itself a direct isometry. 

We summarize these facts as follows. 


Theorem 7 


(a) Every isometry of S? is a reflection, a rotation, or the composite of a 
reflection and a rotation. 
(b) Every direct isometry of S* is a rotation, and vice versa. 


Problem 9 Prove that the composite of an even number of reflections 
of S? in great circles is a rotation of S*. 


7.3. Spherical Trigonometry 


7.3.1. Spherical Triangles 


Recall that a triangle on S”, or a spherical triangle, consists of the arcs of three 
great circles joining three points on S*. We first establish that spherical trian- 
gles exist with any given angles; more precisely, we establish the following 
result. 


Spherical Trigonometry 


Theorem 1 Given any three angles a, 6 and y wiha+fh+y > 7, there 
exists a spherical triangle with those angles. 


Proof We use the idea of stereographic projection, under which circles on 
S? correspond to generalized circles (that is, lines or circles) in the extended 
plane. 

By a rotation of S$”, if necessary, we may assume that one of the vertices of 
a triangle on the unit sphere is at the South Pole, S. 

Under stereographic projection a triangle on S* with one vertex at the South 
Pole S corresponds to a circular-arc triangle in the plane (that is, a ‘triangle’ 
in C whose sides are arcs of generalized circles) with one vertex at O, the 
centre of the unit circle @, and two straight sides. It follows that to prove 
the theorem it is enough to construct a triangle of this form in C that has the 
specified angles. This we now do; in our construction we assume that w and £ 
lie between 0 and 3, for simplicity. 

Consider the circular-are triangle AABP in C that has angles 5 at A, > at 
B, and 6 at P, where PA and PB are straight lines (so that P is the centre of the 
circle through A and B). Let D be the point on PB such that the angle 7BAD 
equals a, let C be the point on PA such that the angle ZABC equals f, let AD 
and BC meet at O, and let the angle 7AOB equal x (as shown). Our first task 
is to evaluate x. 

Both the angles ZAOC and ZBOD are equal to x — x. By considering the 
triangles AAOC and ABOD in turn, we deduce that 


ZPCO = (ma —x)+ (3 a) = “2 x—a, 
and 
ZPDO = (xn — x) + (% —B) =F -x-B. 


Then the sum of the angles of the quadrilateral PCOD is 
an = 0+ (3 -x-a) ++ (3-2-8), 
so that 
x=am+0-a-f. 
It follows that x = y if and only if 
@=a+Bt+y—-Z. 


Hence if we repeat the construction with the value a + 6 + y — z for 0, the 
circular-arc triangle AABO has the desired angles a, 6 and y. It follows from 
the earlier discussion that by stereographic projection we can then construct 
the required spherical triangle on S* with these angles. a 


A modification of the above argument gives the following important result, 
which we state without proof. 
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In fact we also require that 
a+ B+y <5 in view 
of Theorem 3 below — but 
we shall not go into this 
here. 


Theorem 7 of Subsection 
5.2.4 (in which 
stereographic projection 
was introduced). 


Recall that stereographic 
projection is 
angle-preserving 
(Theorem 8 of Subsection 
5.2.4). 


This is possible, since by 
hypothesis 
a+B+y>qr. 
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Theorem 2 The sum of the angles of any spherical triangle is greater 
than z. 


In view of Theorems | and 2, it is useful to assign a name to the difference 
between the sum of the angles of a spherical triangle and z: this measures how 
far the triangle departs from being Euclidean. 


Definition The angular excess of a spherical triangle is the difference 
between the sum of the angles of the triangle and z. 


Problem 1 Construct spherical triangles with angular excess of 
(a) 0.01 radians, (b) 3.14 radians. 


Recall that earlier when we defined a distance PQ between two given points — Subsection 7.1.1 
P and Q on S”, we noted that there were two ‘line segments’ PQ. Now, for 
definiteness, we call the shorter of these two the strict line segment PQ. (If 
both are of length 7, we choose either in some explicit way.) 

Similarly, when we defined a triangle APQR with three given vertices P, 0 
and R on S2, we noted that there were several such triangles. For definiteness, 
we now Call the triangle whose sides are strict line segments the strict triangle 
APQR. Then it is possible to rotate the sphere in such a way that the whole of 
the strict triangle can be seen at one time. (You should convince yourself that 
this is the case.) The inside of the strict triangle APQR is then defined to be 
the region of S* bounded by the three strict line segments. The outside is the 
complement of the inside. 

We now investigate the regions into which the corresponding great circles 
defining the strict triangle APQR divide S*. Notice first that the equatorial 
plane defining each of the relevant great circles cuts S* into two regions. A 
second great circle (and equatorial plane) divides each of these regions into 
two, giving four regions. Finally a third great circle (and equatorial plane) 
divides each of these regions into two, giving eight regions: so there are eight 
spherical triangles with given vertices P, Q and R on S?. 

Now let us look at the strict triangle APQR more closely. For definiteness, 
we will assume that the vertices P, Q and R are chosen in clockwise order as 
viewed by an observer at the centre of the sphere. Rotate the sphere to move 
R to the North Pole, and Q to some point in the (y, z)-plane. By our choice 
of the orientation of APQR and the fact that the line segments PR and PQ are 
strict line segments, it follows that this rotation leaves P on ‘the front’ of the 
sphere, where we can see it. 

The great circle through R and Q is the ‘rim’ of S? that we can see. So the 
great circle RP meets it again at R’, say, the diametrically opposite point to R; 
and the great circle QP meets it again at Q’, say, the diametrically opposite 2 
point to Q. Similarly the great circles PR and PQ meet again at P’, say, the YR 
diametrically opposite point to P. - 


Q 


© 


n 
fi 
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So, associated with the strict triangle APOR in this particular position on S” 
there are three great circles, each meeting in two diametrically opposite points; 
a total of six points in all. The great circles define eight triangles: 


e the strict triangle APOR and the triangles APQ’ R, APQ’R’ and APR’ OQ (all 
visible on the ‘front’ of S?), and 
e four triangles AP’QR, AP’Q'R, AP’Q'R’ and AP’ R’Q ‘round the back’. 


In fact, each of the eight triangles has a diametrically opposite triangle to 
which it is congruent (under the map (x, y, z)  (—x,—y, —z)). For example, 
the triangle APOQR is diametrically opposite to the triangle A P’Q’R’. 

In the figure in the margin we have marked the four ‘visible’ triangles 
APOR, APQO'R, APO'R' and APR’Q with Roman numerals J, II, III and 
IV, respectively. Similarly, we denote their diametrically opposite triangles 
AP‘Q'R’, AP'OR’, AP'OR and AP’RO’ with Roman numerals I’, II’, II’ 
and IV’, respectively. 

In order to label equal angles conveniently, let us denote by a@ the angle 
ZQPR in the strict triangle APQR, by f the angle 7PQR, and by y the angle 
ZQRP. A useful observation is that two great circles meet at the same angles 


at each of their points of intersection; so, for example, the angles ZQPR and 
ZQ' P’R’ are equal. 


Example 1 Show that the triangles APOR and AP’Q’R’ have their angles 
equal in pairs. 


Solution The angle ZQPR at P in APQR is formed by two great circles that 
meet again at P’, forming the angle 7 Q’ P’ R’ at P’ in AP’Q’R’, so they must 
be equal. 

A similar argument shows that the angles at vertices Q and Q’, and the 
angles at vertices R and R’, are also equal. 


Problem 2 Determine the angles in each of triangles II, III, and IV, in 
terms of a, 6 andy. 


Area of a Triangle 


We next find the area of a spherical triangle in terms of its angles. In this 
discussion, we introduce the term lune to denote a two-sided polygon defined 
by two great circles, choosing arbitrarily which of the four such regions on the 
sphere we mean. 

The area of a lune is clearly proportional to the corresponding angle between 
the two great circles, the angle of the lune. Now when the lune has angle 3, 
its area is 2 (for the lune is then one-quarter of S2, which has area 4c); so by 
proportion the area of a lune of angle jz, say, is 2. 


441 


RY 


A lune with vertices at NV 
and S, and angle jp. 
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The ‘front’ of S* as shown in the margin consists of two lunes with vertices 
R and R’, each formed of two strict triangles, I + IV and II + III (using the 
above notation for triangles). The lune I + IV has area 2y , since it has angle y. 
With a slight abuse of notation we express this fact about areas in the form 


I+IV =2y. 


A similar argument involving the lune with vertices Q and Q’ yields the 
equation 
I+I= 26, 


and then with the lune with vertices P and P’ yields the equation 
1+IIl’ = 2a. 


Since triangles III and III’ are congruent, we may rewrite this last equation 
as 
1+ Il = 2a. 


Adding the three equations, we obtain 
3-1+N+W+V=2(a+6+y). 
Also, the area of the front of S? is 
1+1+01+I1V = 2z. 


It follows that 
2-I14+27 =2(a+6+y) 
or 
l=(a+fBt+y)—-7Z. 


This completes the proof of the following theorem for triangles, in the 
special case that they are strict triangles. 


Theorem 3 The area of a spherical triangle is equal to its angular excess. 


Problem 3 Construct a spherical triangle of area — 


7.3.2 Dual Triangles 


The somewhat fiddly nature of the definition of a strict triangle pays off hand- 
somely in a feature of spherical geometry that was often exploited in the 
18th century by Euler and Lagrange, two of the pioneer figures in spherical 
geometry. 

We required that the sides of a strict triangle be less than or equal to z in 
length; for simplicity in what follows, we shall ignore the case of equality. 

Let A, B and C be the vertices of a strict triangle, with angles a, 6 and y at 
the vertices, respectively, and opposite sides of length a, b and c, respectively. 
Each side and angle of the triangle AABC has magnitude between 0 and z. 


The general case is proved 
by dividing an arbitrary 
triangle into strict 
triangles, and applying the 
special case to each; we 
omit the details. 
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We now construct a new spherical triangle AA’B’C’, called the dual of the 
original triangle AABC, with vertices A’, B’ and C’, as follows. Its angles at 
A’, B’ and C’ are w — a, — b and x —c, respectively; and its sides are A’ B’ 
of length x — y, B’C’ of length z — a, and C’A’ of length x — B. 

One way of constructing such a triangle is as follows. Extend the segments 
AB and AC to segments AD and AD’, each of length 4, and draw the great circle 
through D and D’ (this is the equator for A). Similarly, extend the segments 
BC and BA to segments BE and BE’, each of length 4 , and draw the great 
circle through E and E’ (this is the equator for B); and extend the segments 
CA and CB to segments CF and CF’, each of length 4, and draw the great 
circle through F and F’ (this is the equator for C). Then either of the strict 
triangles defined by the great circles DD’, EE’ and FF’ is a spherical triangle 
with the desired properties. 

For example, choose the one that overlaps AABC, and denote its vertex on 
EE’ and FF’ by A’, its vertex on FF’ and DD’ by B’, and its vertex on DD’ 
and EE’ by C’. We now check that the triangle AA’ B’C’ is indeed dual to the 
original triangle AABC. 

The lengths of CF’ and CB’ are both 5, since A’FF’ B’ is the equator for C. 
But B’ is on the equator B’DD’C' for A, so B'A = 3. Thus A and C are on 
the equator for B’, so that the great circle FAC D’ defined by A and C must be 
the equator for B’. This means that D’ lies on the equator for B’, so B’D! = 7. 
Similarly DC’ = 3. 

Now consider the equator B’DD’C’ for A, and denote by O the centre of 
the sphere. The angle DOD’ is equal to ZBAC = a (since both ZDOD’' and 
ZBAC are just the angle between the planes OAB and OAC); thus DD’ lies 
on a (great) circle of radius 1, and so the length of DD’ is also a (since DD’ is 
equal to ZDOD’). It follows that the length of B’C’ is 


B'C = B'D'+ DC'- DD =3+5-a=n-«a. 


Similarly the length of C’A’ is x — B, and the length of A’B’ is x — y. 
Next, by construction CF’ = 4 and BE = 4. It follows that the length of 
F'E is 
F'E = F'C+BE-BC=242%-a=n-a. 


Hence the angle 7F’OE equals z — a (since F’BCE is a circle of radius 1 
with centre O and hence 7F’OE equals the length of F’ E), so that the angle 
ZF'A'E also equals x — a (since both are just the angle between the planes 
OA’ F' and OA’ E). It follows that the angle 7B’ A’C’ equals z — a, as required. 
Similarly, 7A’B’/C’ = x — band ZB'C’A! =x —c. 

We have therefore established the following theorem. 


Theorem 4 Dual Triangles Theorem 

With every strict spherical triangle A there is associated another triangle 
A’, called its dual, whose angles are the complements of the sides of A and 
whose sides are the complements of the angles of A. 
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AA’B’'C' is dual to 
AABC. 


Here by the ‘complement’ 
of a quantity x we mean 
the quantity 7 — x. 
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Here is an example of how the theorem proves useful. Let AABC be a strict 
triangle with ZABC = ZACB. Then, if AA’B’C’ is its dual (which exists by 
Theorem 4), we have A’C’ = A’ B’; it then follows from the Isosceles Triangle 
Theorem that 7A’B’'C' = ZA’C’B’. It then follows by duality that AC = AB. 
This is the converse of the Isosceles Triangle Theorem, in the case of strict 
triangles. 


Problem 4 Let A,B and N be the points (1, 0, 0), (0, 1, 0) and 
(0, 0, 1) on S*. Prove that the triangle AABN is congruent to any of 
its duals. 


7.3.3 


In real life we are often interested in the (spherical) distance between points on 
the surface E of the Earth, whose radius is about 4000 miles. The comparable 
problem for S? is to find a formula for the distance between two points of S*. 
Then a scaling map 


Distance between Two Points 


t:S OE 
xh 4000x 


provides an answer to the original question concerning FE. 

Let P and Q be points on S* with Cartesian coordinates (p1, p2, p3) and 
(41,492,493), respectively, whose distance apart on S? is d. It follows that, if O 
denotes the centre of S*, the angle ZPOQ also equals d. 

Then the dot product of the vectors OP and 00 is 


> lc 
OP-OQ = piqi + p2g2 + p3g3 = 1+-1-cos d, 


so that cosd = piqi + p2qg2 + p3q3. 


Theorem 5 The distance d on S* between the points (pj, p2, p3) and 
(41, 92,43) is given by 


cos d = piqi + p2q2 + P34q,- 


In applications to E, we must remember that if a point has latitude 0’ and 
longitude @ then it has colatitude 6 = 5 — 6’ and Cartesian coordinates 
(cos @ sind, sing sin @, cos @). 


The latitudes and longitudes of various cities are approximately as follows. 


City Latitude Longitude 
New York 41° —7A4° 
Rio de Janeiro —23° —43° 
Sydney —34° 151° 
Tokyo 36° 140° 
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Section 7.2.2, Theorem 6 


We ask you to prove the 
general case of the 
converse in Section 7.6. 


In metric units, the radius 
of the Earth is about 
6378 km. 


Here we take the unit of 
measurement for the 
radius $2 to be | mile. 


Q 


Q 


great circle 


In the expression cos d,d 
must be taken in radians, 
as usual. 


Remember that 
(geographers’ ) longitude 
West of the Greenwich 
meridian is negative. 
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Example 2 Estimate the distance between New York and Sydney, taking the 
radius of the Earth as 4000 miles. 


Solution The colatitudes of New York and Sydney are 90° — 41° = 49° and 
90° + 34° = 124°, so that the coordinates of the corresponding points P and 
Q on S$” are 
(cos(—74°) sin(49°), sin(—74°) sin(49°), cos(49°)) 
~ (0.2756 - 0.7547, —0.9613 - 0.7547, 0.6561) 
~ (0.208, —0.725, 0.656), 


and 


(cos(151°) sin(124°), sin(151°) sin(124°), cos(124°)) 
~ (—0.8746 - 0.8290, 0.4848 - 0.8290, —0.5592) 
~ (—0.725, 0.402, —0.559). 


Hence the distance between New York and Sydney is approximately 4000d 
miles, where 


cosd ~ —0.208 - 0.725 — 0.725 - 0.402 — 0.656 - 0.559 
~ —0.151 — 0.291 — 0.367 = —0.809. 


Thus d ~ cos~!(—0.809) ~ 2.513 radians, and the required distance 4000d 
~ 4000 - 2.51 ~ 10 050 miles. 


Problem 5 Estimate the distance between Tokyo and Rio de Janeiro, 
taking the radius of the Earth as 4000 miles. 


Right-Angled Triangles 


It only makes sense to discuss Pythagoras’ Theorem in those geometries witha Theorem 8 of Subsection 
distance. We have already seen Pythagoras’ Theorem in hyperbolic geometry, 6.4.3 
so what is the analogue in spherical geometry? 


Theorem 6 Pythagoras’ Theorem 
Let AABC be a triangle on S* in which the angle at C is a right angle. If a, 
b and c are the lengths of BC, CA and AB, then 


COSIGI— COS eGCOSiD: 


Proof We may rotate S? so that C is the North Pole N (0,0, 1) of S*,A lies Since the sphere has 


on the Greenwich meridian and has coordinates (sinb,0,cosb), and B has tadius I, at A,@ = b and 
@ = 0; and at B,p = F 


coordinates (0, sina, cos a). a5 
an =a. 


We then apply the rotation R(Y,—b) to S*. This maps A to C,C to some 
point C’, and B to some point B’. 
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Let a denote the angle 7CAB. Then the rotation R(Y,—b) maps the angle : 
ZCAB to the angle 7C’CB’, so that the angle ZC’CB’ = a:; it follows that the 
angle ZACB’ = x — a since ZACB’ + ZB/CC’ = mz. Hence the longitude of 
B’ is x — a. Also, the colatitude of B’ is 


length of CB’ = length of AB = c. 


So the coordinates of B’ are 


(cos(z — @) sinc, sin(z — @) sinc, cos c) 
= (—cosa@ sinc, sin@ sinc, cosc). (1) 


But we also have as coordinates of B’ 


cosb 0 —sinb 0 
R(Y, —b)(B) = 0) 1 0) sina 

snb 0 cosb cosa 

—sinbcosa 

= sina : (2) Here we use the formulas 
cosbcosa for coordinates under 
isometries given in 
Equating the third coordinates in (1) and (2), we get the formula Subsection 7.2.1. 


cosc = cosa x cosh, 


as required. | 


For example, if a = 1,b = 2 and ZACB = a8 then 
cosc =cos1 x cos2 ~ 0.5403 - (—0.4161) ~ —0.2248, 
so that c ~ 1.798. 


Problem 6 APOR is a triangle on S* in which the angle at R is a 
right angle and the lengths of PQ and PR are 1.7 and 1.9, respectively. 
Estimate the length of OR. 


Remark 


For all values of x, cosx = 1 — am higher powers of x; so, when x is small, 

cosx ~ 1— 5x. It follows that, if we are dealing with small triangles on S”, 
then the conclusion of Theorem 6 may be interpreted in the form 

120 12 1g 2\ ns 1,2 1,2 

1 = $e? = (1 ga?) - (1 45?) = 1 a? — 3B?, 


2 Here we drop the term in 


ab, since it is smaller 
so that c? ~ a?+b*. Thus Theorem 6 is a close approximation to the Euclidean than the remaining terms. 


version of Pythagoras’ Theorem in R?. 


The area formula for S* suggests that the angles of a spherical triangle deter- Theorem 3 
mine its size as well as its shape. In fact, any three pieces of the six possible 


Spherical Trigonometry 


pieces of information about a triangle (its three angles and the lengths of its 
three sides) determine the remaining three. 

Returning to the formula in the proof of Theorem 6 for the coordinates of 
B’ and equating the first and second coordinates, we obtain 


cosasinc =sinbcosa and = sinasinc = sina. 


Dividing the second of these by the first, we get tana = tana/sinb. 
Re-arranging the above formulas to make them easier to remember, we have 
the following result. 


Theorem 7 Let AABC be a triangle on S” in which the angle at C is a 
right angle. If a,b and c are the lengths of BC, CA and AB, and a denotes 
the angle ZC AB, then 


cosc = cosa x cosb, 


z sina tana 
sina = ——, and tana= 
sinc 


A similar discussion to that above gives that 
cosP sinc = sinacosb and sinf sinc = sinb, 


so that 
: sin b tanb 
sinf = —— and tanp= : 
sinc 


sina 

From these formulas and those in Theorem 7 above, it follows that in a right- 
angled triangle two further pieces of information about the triangle enable us 
to determine all its angles and sides. 


Problem 7 APOR is a triangle on S* in which the angle at R is a 
right angle and the lengths of PQ and PR are 1.7 and 1.9, respectively. 
Estimate the angle ZOPR. 


7.3.4 Spherical Trigonometry 


In this subsection we deduce various formulas of spherical trigonometry for 
strict triangles by dropping perpendiculars from a vertex of a spherical triangle 
to the opposite side, giving two right-angled ones for which we already have 
formulas. First, we verify that we can always drop a perpendicular from a point 
to a line. It follows that a spherical triangle has 3 altitudes. 

Given a strict segment AB and a point C on S? that form a triangle AABC, 
there is always a great circle through C perpendicular to AB. This great circle 
meets the great circle of which AB is part in two points, which we call D and 
D’. Since D and D’ are antipodal points, they are a distance apart, so one of 
three things must happen. 
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We prove this fact here for 
right angled triangles; a 
proof for general triangles 
can then be constructed by 
dividing a general triangle 
into two right angled 
triangles using an altitude. 


In the proof of Theorem 6 
and before Theorem 7 
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1. One of D and D’ coincides with A or B. The corresponding strict line 
segment is the perpendicular from C to AB. This case will not lead us to 
any new results, so we do not consider it further. 

2. One of D and D’, say D, lies between A and B. Then we call CD the 
perpendicular from C to AB; it lies inside AABC. 

3. One of D and D’, say D, is such that the distance AD is less than z and B 
lies between A and D. Then we again call CD the perpendicular from C 
to AB; in this case it lies outside AABC. 


Now consider a strict spherical triangle AABC, in which the sides AB, BC 
and CA have lengths c,a and b, respectively, and the angles at A, B and C are 
a, B and y, respectively. Drop the perpendicular CD from C to AB, and let d 
denote the length of CD. 

If we apply the second formula in Theorem 7 to triangle AADC we get 


so that sind = sina x sinb. Applying a similar argument to triangle ABDC 
we get sind = sinf x sina. 
Thus, sina x sinb = sin x sina, which is more memorable in the form 


sina sin 


sina sinb 


Then, by a similar argument involving the perpendicular from A to BC, we 
obtain the following result. 


Theorem 8 Sine Rule for a Strict Spherical Triangle 

Let AABC be a strict triangle on S?, in which the sides AB, BC and CA 
have lengths c, a and b, respectively, and the angles at A, B and C area, 6 
and y respectively. Then 


sina sinf  siny 


sina sin b sinc 


We can use an argument similar to that in the proof of Pythagoras’ Theorem 
to find further results. 

Let AABC be a strict triangle on S*. We may rotate S? so that C is the 
North Pole N(0,0, 1), A lies on the Greenwich meridian and has coordinates 
(sin b,0,cos b), and B has coordinates (cos y sina, siny sina, cosa). 

We then apply the rotation R(Y,—b) to S*. This maps A to C,C to some 
point C’, and B to some point B’. As we saw earlier, the coordinates of B’ are 
(— cos a@ sinc, sina sinc, cos c). 


Theorem 6 


In the proof of Theorem 6 
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But we also have as coordinates of B’ 


cosb O —sinb cos y sina 
R(Y, —b)(B) = 0 1 0 siny sina 
sinb 0 cosb cos a 


cosbcosy sina — sinbcosa 
sin y sina 


sinbcosy sina + cosbcosa 
Equating the third coordinates in the two expressions for B’ we get 


cosc = sinbcos y sina + cosbcosa. 


Theorem 9 Cosine Rule for Sides of a Strict Spherical Triangle 
Let AABC be a strict triangle on S?, in which the sides AB, BC and CA 
have lengths c,a and b, respectively, and the angles at A, B and C are a, 6 


and y, respectively. Then Notice that this formula 
F ; reduces to Pythagoras’ 
cosc = cosacosb + sinasinbcosy. Theorem when y is a 


right-angle. 


Example 3. Let AABC be a strict isosceles triangle on S*, in which the sides 
AB, BC and CA have lengths c,a and a, respectively, and the angles at A, B 
and C are a,a and y, respectively. Prove that 

tan (4c) 


(a) cosc = cos*a + sin? a cos y; (b) cosa = 
tana 


Solution Putting b = a in the conclusion of Theorem 9, we obtain the 
formula (a). 
Next, by applying Theorem 9 to find cos a, we get 


cosa = cosacosc + sina sinc cosa. 


We may then rearrange this formula to get 


cos a(1 — cosc) 


cosa = : : 
sina sinc 
_ cosa -2 sin (5c) _ tan (5c) Here we use the identities 
sina - 2sin (5c) cos ($c) tana * cos 2x = 1 —2sin? x 
as required. and 


sin 2x = 2 sin x cos x 


Problem 8 Let AABC be a strict equilateral triangle on S?, in which for x € R. 
the sides have length a and the angles are equal to a. Prove that 
cosa 


cosa = ———_. 
1+ cosa 
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We can now use any dual triangle AA’B’C’ of AABC to prove a new 
trigonometric result. The dual triangle has sides of length w — a,a — 6 and 
mz — y, and has angles of magnitude 7 — a,ma — b and x — c. By applying 
Theorem 9 to the dual triangle and using the formulas sin(a — x) = sinx and 
cos(z — x) = — cos x, we get 

— cosy = cosa cos f — sina sin B cos c; 


and we can reformulate this formula as follows. 


Theorem 10 Cosine Rule for Angles of a Strict Spherical Triangle 
Let AABC be a strict triangle on G2. in which the sides AB, BC and CA 
have lengths c, a and b, respectively, and the angles at A, B and C area, B 
and y, respectively. Then 


cos y = sina sin B cosc — cosa cos B. 


7.4 Spherical Geometry and the Extended Complex Plane 


Earlier we studied the Riemann sphere and stereographic projection of the 
sphere onto the extended complex plane C = C U {oo}. 


7.4.1 Stereographic Projection 


In this section we regard stereographic projection as a mapping z of the sphere 
S? onto an extended plane IR? U {oo}, under which the point (X,Y, Z) maps 


onto the point 7(X, Y, Z) = (42. 7) ; the inverse mapping is given by 


2x 2y een 


= 
Iv Xx, = > ? 
vo ae 


Example1 Let C denote a great circle on S* that is the intersection of S? with 
the plane aX + bY + cZ = 0, where et+b+ec=1,c # 0. 


(a) Determine the point N’ that is the image of N under reflection in C. 
(b) Determine the images of N’ and C in R? under stereographic 
projection z. 

(c) Verify that 2(N’) is the centre of the circle z(C). 
Solution 
(a) The point N’ has coordinates 

(0,0, 1) — 2{(0, 0, 1) - (a,b, c)}(a, b,c) 

= (0,0, 1) — 2c(a,b,c) = (—2ac, ee 2c*) 


You met dual triangles in 
Subsection 7.3.2. 


Subsection 5.2.4 


Here we denote the 
Riemann sphere by S? 
rather than S. 


Since c £ 0, C does not 
pass through the North 
Pole N of S?. 


We follow the approach of 
Subsection 7.2.2. 
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(b) From the formula for zr, the image of N’ under zr is 
g 


—2ac —2bec 
1— (1 —2c?)’ 1-— (1 — 2c?) 


m(—2ac, —2be, 1 — 2c”) = ( 


From the formula for 7 ~! the image 2(C) of C has equation 


2x +b 2y 4 Kooy = 1 
a (63 = 
x2+y2+] x274+y2+1 eyed 


we may rewrite this in the form 


2ax + 2by +(x? +y?—1) =0, 
or 


b 
ety? 422x422y-1=0. 
é c 


(c) Since the equation of the circle 7(C) may be written in the form 


a\2 b\? a+ 1 
(x+°) + yr) =l+—_,—-=s; 
Cc Cc (os Cc 


the centre of z(C) is the point (- — b), that is, the point z(N’). 


Cc? 


Problem 1 Let P = (x,y) be a point in R? other than the origin. 
Let P’ be the point on S* that corresponds to P under stereographic 
projection, P” the reflection of P’ in the (X, Y)-plane, and Q the point 
in R? that corresponds to P” under stereographic projection. 


(a) Determine the coordinates of P’, P” and Q. 
(b) Prove that P and Q are inverse points with respect to the unit circle 
x*+y? =1inR?. 


There is a simple and useful expression for the image under stereographic 
projection of a point on S* with given colatitude and longitude. 


Example 2 The point P’ on S* with colatitude 6 and longitude ¢ has coor- 
dinates (cos ¢ sin@, sing sin@,cos@). Prove that the image P of P’ under 


stereographic projection has coordinates { 2, 2 
tan( 56) tan( rv) 


Here we use the fact 


c #0. 


Again we use the fact 


c #0. 


Recall that the latitude of 


Piso’ =% 9. 
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Solution From the formula for z, the image P of P’ under z is the point 


a er) 


1—cos@’ 1—cosé 


cos @ -2sin (39) cos (58) sing - 2 sin (58) cos (39) 
2 sin? (56) , 2sin? (56) 


cos d sind 


tan (58) tan (58) 


as required. 


Next, we obtain a neat formula relating the distance on S* between the 
South Pole S and a point P’ on S? to the distance of the corresponding point 
P =7x(P’) from O in R?. 

Let P be a point (x, 0) in R?,x > 0. The corresponding point P’ = 2~!(P) 


2x x2-1 
14x?’ 0, x24] 


ical polar coordinates on S? as (cos @ sin@, sing sin@,cos@), where @ is 
colatitude, that the colatitude @ of P’ satisfies the equation 


on S” has coordinates ( ); it follows from the definition of spher- 


2x 
x2—1° 


tand = 


(1) 


Then if the spherical distance of P’ from S is d, the colatitude 6 of P’ also 
satisfies the equation 0 + d = zr, so that 


tan@ = —tand. (2) 


Then, if we denote by T the quantity tan (52), it follows from equations 


(1) and (2) that = i=- ate. We can write this equation in the form 
(T — x)(1+ xT) = 0; and, since both x and T are positive it follows that 
T = x. In other words tan~! x = tan7! T = sd, so that d = 2tan7! x. 

Since a rotation of S? and R? about the Z-axis does not change any of the 


distances in the above argument, we deduce the following result. 


Theorem 1 Let a point P’ on S* map under stereographic projection to a Here Q does not 
point P in R? with coordinates (r cos @,r sin@). Then the spherical distance represent colatitude. 
of P’ from the South Pole S is 2tan~!(r). 


Problem 2 Let P and Q be the points (2,2) and (3,3) in R*, and 
P’ and Q’ the corresponding points on S* under the mapping 2~!. Use 
Theorem | to determine the spherical distance P’Q’. 


Spherical Geometry and the Extended Complex Plane 


7.4.2 Conjugate Transformations 


We have seen a correspondence between points of the sphere S? and points of 
the extended plane. We now investigate the correspondence between spherical 
transformations and Mobius transformations. 

Earlier we saw that a spherical transformation is the composition of at most 
three reflections in great circles, and that the composition of any two reflections 
is a rotation. Here we shall see that reflection in a great circle corresponds to 
inversion in the image of that circle under stereographic projection, and that 
a rotation of S? corresponds to a particular kind of Mobius transformation. 
Thus we have two equivalent descriptions of spherical transformations, one on 
the sphere using 3 x 3 matrices, and one in the extended plane using Mébius 
transformations. 

Firstly, we establish the following pleasant result. 


Theorem 2 Let P’ and Q’ be points on S” that are mirror images under 
reflection in a great circle C’, and let the images of P’, Q’ and C’ under 
stereographic projection z be P, Q and C, respectively. Then P and Q are 
inverse points with respect to C. 


Proof The great circle lies in a plane, II say, through the origin. IT consists 
of those points in R? that are equidistant from the points P’ and Q’. So any 
sphere through P’ and Q’ has its centre somewhere in I, and so any circle 
on S* through P’ and Q’ also has its centre in I]. Thus any circle C” on S? 
through P’ and Q’ meets the great circle C’ at right angles. 

Under stereographic projection zr, the circles C’ and C” map to generalized 
circles; since stereographic projection is angle-preserving, these images meet 
at right angles. 

Now let C* be any generalized circle through P and Q. The image of C* 
' is a circle on S? through P’ and Q’; so a !(C*) is some circle 
C" on S? that meets C at right angles. Then, since stereographic projection is 
angle-preserving, it follows that C* meets C at right angles. 

But C* is any circle through P and Q, so any circle through P and Q meets 
C at right angles. It follows that P and Q are inverse points with respect to C, 
as required. | 


under 7 


Notice that, if C f and Cc are a great circle and a little circle on Ss? and 
C, and C2 are their images in the extended plane under zr, then there exists a 


Subsection 7.4.1 


Theorems 5 and 4 of 
Subsection 7.2.2 


Here we are using 
Theorem 4 of 
Subsection 5.5.2. 
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Mobius transformation that maps C; onto C2, but there cannot be a rotation 
of S? that maps C i onto C. Hence there are Mobius transformations of the 
extended plane that do not correspond to rotations of the sphere. 

Now, inversion in a generalized circle in the extended plane is necessarily 
a Mobius transformation composed with a complex conjugation. It follows 
from Theorem 2 that any spherical transformation corresponds to a Mébius 
transformation, composed (if need be) with a complex conjugation. 

So each rotation R of S* corresponds to some Mobius transformation of the 
extended plane by means of stereographic projection z. If P is a point in the 
extended plane, it follows that 7~!(P) is a point on S*; Ra! (P) is the image 
of that point after the rotation, and 7 Rx~!(P) is the projection of that image 
onto the extended plane. 

So, 7 Rx~! is a mapping of the extended plane to itself. We therefore make 
the following definition. 


Definition Let R denote a rotation of S*, and m stereographic projec- 
tion. Then the transformation 7 Rm~! of the extended plane is called the 
conjugate transformation of the rotation R. 


It follows from the above discussion that the image of the group of rotations 
of S* under z is a group of Mobius transformations. 


Lemma The conjugate transformation of a rotation of S* is a Mébius 
transformation of the extended plane. 


Indeed it can be proved that the mapping R +> 2Rx~! maps the group of 
rotations of S* isomorphically onto a group of Mobius transformations of the 
extended plane, but that it is not the whole group of Mobius transformations. 
We now characterize the Mobius transformations that are conjugate to rotations 
of S. 


Theorem 3 A Mobius transformation M that is conjugate to a rotation of 


S? is of the form M(z) = 2+, where a,b € C. 
—bz+4a 


Proof There is one family of rotations of S* that we can easily relate to 
Mobius transformations, namely rotations around the Z-axis. Their conjugate 
transformations are simply rotations about the origin through the same angle. 
Our strategy will be to use reflections of S? to reduce a given rotation of S? to 
one about the Z-axis. 

Let R denote a rotation of S* about the (diametrically opposite) points P and 
Q of S*, through an angle 6, and let r denote the reflection in a suitable great 
circle of S? that maps P to the North Pole N; so that r(P) = N. Then since r 
is a reflection, it is its own inverse, so that r(N) = P. Similarly r(Q) = S, the 
South Pole, and r(S) = Q. 
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By Theorem 10, 
Subsection 5.3.7 


For both are 
orientation-preserving 
mappings. 


This was proved in the 
earlier discussion. 


We omit a proof of 
this fact. 


We saw this in the 
discussion above. 


plane of reflection 


S 


So we shall often use r 
and r—! interchangeably 
below. 
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The spherical transformation r~! Rr is a direct isometry of S”, and so is a 
rotation of $7. Since 


(rR) (N) = (-'R) (r(N)) 
= (r-'R) (P) 
=r !R(P)=r |(P)=N, 


the point AN is a fixed point of the rotation; similarly S is also a fixed point. So 
r—!Rr is a rotation of S* about the Z-axis. 


It follows that the transformation x(r~'Rr)a~! is a rotation of the 
(x, y)-plane about the origin. Denoting this by M, it follows that M(z) = e!®z, 
for some real number 0. 

Now 


M=n (rR) a! 
=ar_! (x-'x) R (x-'x 


= (arn!) (Rx!) (ara); 


so that, since we know M, in order to know 2 Rx! it is now sufficient to find 


For 2 ~!s is the identity 
map of S* to itself. 
) rx! 


nie 
Since N and P map to each other under the reflection r in a suitable great 
circle of $*, the mapping zrz~! must be inversion in a circle in the plane. By Theorem 2 and 
Now the great circle of S* is the intersection of the sphere X7+Y7+Z*=1 Thar. 
with some plane aX + bY + cZ = 0; we may assume that a? + b* +c? = 1. This assumption will 
Since the plane cannot be vertical, c 4 0. simplify our calculations. 
By applying the formula for 2! to the equation of the plane, we see that 
the image of the great circle is a circle C in the (x, y)-plane with equation 


2ax + 2by +e (x? +y?-1) =0, 
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or 
>) a a b 
x+y +2-x+2-y—-1=0. 
c c 


b 


The circle C has centre (- a — 2), and the square of its radius is 


Next we express things in terms of complex numbers. Let a = —* — i%, so 
7 2442 ; ee : : 
that a&@ = |a|? = ow Now inversion in C is a mapping of the form 
Ljey 
aa (/c) ey 
Z-@ 
_ a% — |a|? + (1/c)? 
Z-a@ 
_ a+ 
z-a’ 
so that 
_ az+1 
arm '(z) = ———- 
Z-a@ 
Then 
_ ig OZ+1 
M(arn ')(z) aud: re ens 
Z-a 
so that 
-i0 | aztl 
Rx \( et | a) +1 
een a eid. ztl _ G 
Z—-a 


ae (az + 1) + (z—a@) 
~ 6-8 (@z+ 1) —a(z—a) 
Zz (lo|?e~/? + 1) + (ae? — a) 
~ z(@e~9 — &) + (e+ Jal?) 
z (|a|2e19/2 4 8/2) 4. (ye i9/2 — aei?/2)_ 
z (ae 19/2 — Gel) + (eH)? + |a|2e!8/2) ° 


thus wRa~!(z) = ae where a = |a|*e~!9/? + e/9/? and b = ae~'9/? — 


; bz+ 
ae!?/? as required. | 
Remark 
Notice that the Mébius transformations M(z) = ee conjugate to a rotation 


of S? are rather like the Mobius transformations representing direct hyperbolic 


transformations M(z) = gett, with |b| < Jal. 


For, a7 +b? +c¢* =1 


a is the centre of the circle 
in the plane. 


You met this formula in 
Theorem 3, Subsection 
5.2.1. 


For, —|a|? + (l/c)? = 1 


You met these Mébius 
transformations in 
Theorem 2, Subsection 
6.2.1. 


Spherical Geometry and the Extended Complex Plane 


Problem 3 The two fixed points of a rotation of S? are distinct from 
N and S. Show that they map under stereographic projection to points in 
C of the form s and —1/s. 


Example 3 Prove that a Mobius transformation M that is conjugate to a 


rotation R(Y, B) of S? is of the form M(z) = oe where a, b are real. 


Solution A rotation R(Y, f) of S? fixes the points (0, 1,0) and (0, —1,0), 
which map under stereographic projection to the points 7 and —i, respectively. 
By Theorem 3, a conjugate Mobius transformation must be of the form 


cz+d 
M(z) = —= = 
—dz+c 
this fixes i and —i if 
cit+d —cit+d 
i = —=—— and -i=-— 
—dit+c di+c 


which we can rewrite in the form 
d+di=cit+d and d—@Gi =—-ci +d. 


Adding these equations gives 2d = 2d, so that d is real, and subtracting the 
equations gives 2ci = 2ci, so that c is real. 

It follows that M has the desired form M(z) = we. with a in place of c 
and —b in place of d. 


Problem 4 Prove that a Mobius transformation M that is conjugate to 


a rotation R(X,a@) of S* is of the form M(z) = ae, where a,b are 
real. 


We end with some interesting observations concerning Mobius transforma- 
tions. A fixed point of a Mébius transformation M(z) = “+8, ad — bc # 0, is 


cz+d’ 


a point z for which M(z) = z, that is, tak = zorez?+(d—a)z—b=0. 
Since a quadratic equation can have at most two roots, it follows that a Mébius 


transformation can have at most two fixed points. 


Problem 5 Determine the fixed points in C and in C of the following 
Mobius transformations: 


zeeztlh zRe 


Example 4 Prove that a Mébius transformation M with distinct fixed points 
a and b in C maps the family < of Apollonian circles defined by the point 
circles a and b to itself. 
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We use c and d to avoid 
using the letters a and b 
for several different 
things. 
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Solution Let ¥ be the family of generalized circles through a and b. 

Since M has fixed points a and b, it maps the family & onto itself. Also, 
since a Mobius transformation preserves angles, M thus maps the family of 
generalized circles that is orthogonal to those in ¥ to itself. This family is the 
family ./ of Apollonian circles defined by the point circles a and b. 


Example 5 Prove that a Mobius transformation M with distinct fixed points 
a and b in C either maps each circle in the family & of Apollonian circles 
defined by the point circles a and b to itself or it maps no circle in & to itself. 


Solution By a suitable Mobius transformation f, we can map the point a to 
0 and b to oo. Then f maps the family </ of Apollonian circles defined by the 
point circles a and b to the family Z of concentric circles with centre 0. 

Then f o Mo f~! is a Mobius transformation, and it maps 0 to 0 and oo 
to oo. It follows that f o M o f~! is of the form z + Kz, for some non-zero 
complex number K. 

If |K| = 1, then f oMo f—! maps each concentric circle in F to itself; but 
if |K| 4 1 then f oMo f~! maps no circle in F to itself. 

If we then apply the mapping f~! to return to the original situation, the 
required result follows at once. 


7.4.3 


Stereographic projection enables us to interpret the theory of coaxal circles, 
including all its special cases, very elegantly on the sphere S?. 

Earlier you saw that, if C is the circle cut out on S? by the plane with 
equation aX + bY +cZ +d = 0, then stereographic projection maps C to: 


Coaxal Circles 


(a) the circle with equation 


Payne! gg” yg: eee 
- c+d ct+d° ctd ” , 
(b) the straight line with equation 
—d 
ax + by — =0, ife=—d. 


Problem 6 Prove that a circle in R? with equation 
x7 + y? 4+ 2ax + 2Byt+y =0 


corresponds under stereographic projection to the circle cut out on S$? by 
the plane with equation 


.eX 4 28V 20 = Viz 0s YW=d 
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A Mobius transformation 
maps generalized circles 
onto generalized circles. 


By Theorem 3, 
Subsection 5.5.1 


It is easy to verify that a 
Mobius transformation of 
€ that has 0 and 00 as its 
fixed points is necessarily 
of the form z+> Kz. 


You met coaxal circles in 
Subsection 5.5.2. 


In the proof of Theorem 7 
of Subsection 5.2.4 


If c = —d, the circle C 
passes through NV. 
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Definition The circle in R* with equation 
x? + y? 4+ 2ax + 2By+y =0 
and the plane with equation 
2aX +268Y +—-—y)Z+U+y)=0 


are said to be associated. 


We now use the idea of associated planes and circles to revisit the theory of 
coaxal circles. 
Let C; and Co be distinct circles in R? with equations 


x? + y? + 2ax + 2Biy +y1 =0 


and 
xe + y + 2a2x + 2Boy + yo = 0, 
respectively; then their corresponding associated planes zr and zz have equa- 
tions 
2a;X +261Y+(—-yw)Z+U+y) =0 
and 
202X + 2PoY + (1 —y)Z+ (1+) =0. 


Now, either 7; and zr2 are parallel or they are not parallel; they cannot be 
the same plane, since the associated circles are distinct. 

First, suppose that zr, and zr are not parallel, so that they have a common 
line, @ say. 

For arbitrary p and qg,p #4 —g, let Cpq be the generalized circle with 
equation 


P (x? + y? + 2arx + 261y + 11) 
pt+4q 
q 2. 32 = 
+—1_ (17+ y?+ 2aax + 2boy + 72) =0, 
pt+4q 


and let pq be its associated plane, with equation 


a] 
a (201 X +26¥+0-y)Z+04+n)) 


eee (202X + 2fo¥ +(1— yw) Z+ (1+ y2)) =0. 


The associated plane 77), passes through the line ¢; so, as p and q vary, Tpgq 
gives a family of planes through £. There are three cases to consider. 

Case 1: The line £ does not meet S*. 
Here the family of planes cuts out a family of circles in S* consisting of two 
point circles P and Q and a set of disjoint circles in between these. 

Under stereographic projection from N, the associated circles in the plane 
form an Apollonian family: a family of coaxal circles with two point circles. 
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That is, they correspond to 
each other under 
stereographic mapping. 


We allow C; and C2 to be 
point circles. 
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Case 2: The line £ touches S? (at the point P, say). 

Here, every plane of the family meets S*. The plane tangent to the sphere at 
the point P meets it ina single point, P. All the other planes cut S? in a circle. 
All these circles have a common tangent at P, which is the line ¢. 

Under stereographic projection from N, the associated circles in the plane 
form a family of coaxal circles having a common tangent. If N and P are 
distinct, the image circles are ‘true’ circles except for one straight line; if NV 
coincides with P, the images are parallel lines. 

Case 3: The line £ cuts S* in two points, say P and Q. 

Here, every plane of the family meets S*. Each plane cuts the sphere in a circle 
passing through the points P and Q. 

Under stereographic projection from N, the associated circles in the plane 
form a family of coaxal circles with two common points. If NV, P and Q are 
distinct, the image circles are ‘true’ circles except for one straight line; if NV 
coincides with either P or Q, say P, the images are straight lines passing 
through the image of Q. 

On the other hand, if the associated planes zr; and zp are parallel then the 
family of planes z pq consists of the planes parallel to 71 and 2, and the circles This is similar to the 
they cut out on the sphere consist of two point circles and a set of disjoint — situation in Case | above. 
circles in between these. 


7.5 Planar Maps 


We use maps of the Earth on a daily basis: for example. street maps of towns, 
road maps of countries, geological maps of continents, and for land, sea and 
air travel. Most of our maps are on flat sheets of paper, rather than globes; and 
we have to reduce the actual dimensions by a convenient scale factor, such as __ Here, scale factor = 
1:12,000,000 for a continent, 1:100,000 for a map of a large city, or 1:25,000 length on map divided by 
for a detailed neighbourhood map. neon airs eaeaees 
We will assume in this section that the Earth is a sphere. In fact, the shape 
of the Earth is closer to that of an oblate spheroid, a sphere flattened near each 
pole and with a bulge around the equator. This bulge results from the rotation 
of the Earth, and causes its equatorial diameter to be over 26 miles (43 km) 
greater than its pole-to-pole diameter 
Now an ideal planar map would represent shapes on the Earth’s surface 
in such a way that relative distances, relative areas, shape, and angles are 
preserved under the operation of mapping. Unfortunately, no ideal map is 
possible! 
For example, let AABC be a triangle on S, and let P Q, R be the points 
on the planar map that correspond to A, B, C, respectively. Then the sides of Subsection 7.3.1, 
AABC are line segments on $7, that is, parts of great circles; and the sum of | Theorem 2 
its angles exceeds zr. The sides of APQR are line segments in the plane, and 
the sum of its angles is 7. So we cannot simultaneously have geodesics on S* 


Planar Maps 


mapping onto geodesics in the plane AND preserve angles. This shows that an 
ideal planar map cannot exist. 

Furthermore, Euler showed in 1775 that it is not possible to map a portion 
of the Earth’s surface onto a planar map in such a way that relative distances 
(that is, distances up to a scaling factor) are preserved. 


To see this, let E denote the surface of the Earth, a sphere with centre O and 
radius R; and let P’ € E be the centre of a circle C’ on E of radius r (we will 
suppose that r < 57 R, so that C’ lies in a hemisphere on E with P’ as pole). 
Suppose that some ideal mapping represents the hemisphere onto a flat sheet, 
x say, and let P C on z represent P’, C’ on E. 

Since the mapping scales all distances by the same scale factor, c say, all 
points on E at a distance r from P’ map to points at a distance cr from P; 
hence C’ is represented on x by a circle C of radius cr — so that the length of 
C is 20cr. 

Next, let Q’ be any point on C’, and N’ the foot of the perpendicular 
from Q' to OP’. Then ZP’OQ’ = %, so that Q'N’ = Rsin(ZP'0Q’) = 
R sin (z)- But the circle C’ is a circle in R* with radius Q’N’, so that its 
length is 27 R sin (4). Hence its representation C on z must have length 
c x 27 Rsin (4). 

We deduce from the last two paragraphs that 


2acr = 2mcR sin (=) . 
R 
so that 


sins < | for all x withO <x < 4. 


So we have proved that it is not possible to represent a portion of the Earth’s 
surface on a planar map in such a way that relative distances are preserved. It 


This equality is impossible, since 


follows from this, too, that no (precisely accurate) planar map of a portion of 
E can have a fixed scale factor for the whole map. 

In fact, there are three principal types of map projections for representing 
the spherical Earth, E, on a flat sheet of paper, z. In their work, cartographers 
often use the term meridians to denote lines of latitude, and parallels to denote 
lines of longitude: we will follow this convention here. 
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Recall that ‘geodesics’ are 
curves of shortest length 
between two points. 
Leonhard Euler 
(1707-1783) was a 
prolific Swiss analyst, 
who worked for Frederick 
the Great (Prussia) and 
Catherine the Great 
(Russia) even after 
becoming blind. 


Here r is a distance 
measured along the 
surface. 


This is a standard result in 
Calculus. 


In addition there are 
various purely 
mathematical approaches, 
for special purposes. 
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Azimuthal Projections 


Azimuthal projections of E onto z are ones where we project E onto a sheet 
z that is tangent to FE at some point, often the South Pole S. We can think of the 
projection mapping in the following way: a light source placed at some point 
(the centre of the projection) on the perpendicular to z at the point of contact 
projects each point P’ on E onto a corresponding point P on 2. Wherever we 
place the source, meridians map to radial lines from the point of contact S, 
parallels to concentric circles with centre S, and directions from the point of 
contact are preserved. 


If the source is placed at the centre of E, the projection is called the gnomic 

projection, and we obtain a map of the lower hemisphere of E. Since any plane By making the tangent 
through O meets E ina great circle and z ina line, each line in z comes from point NV, we could obtain 
a great circle on E. Hence geodesics on E are mapped onto straight lines on 7. ™aP Of the upper 

If the light source is placed at the North Pole N of E, we obtain a map of the nee: 
whole of E—{N} on z. This projection is called the stereographic projection of 
E onto zr, and is the stereographic projection that you met earlier. It preserves 
angles, and so is called a conformal projection. 

If the light source is placed ‘at infinity’ along the direction SON, and we 
consider only the upper hemisphere or the lower hemisphere of E, then we 
obtain a map of one hemisphere of E onto z that is actually circular. This 
projection is called the orthogonal or orthographic projection. It preserves 
distances along parallels. 


Subsection 5.2.4 


Cylindrical projections 
Cylindrical projections of E onto z are ones where we project E onto a sheet 


x that is rolled in the form of a cylinder round EF, touching it at its Equator; 
then we slit the sheet and flatten it. 


a 
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In one cylindrical projection, a light source placed at the centre O of E, the 
centre of the projection, projects each point P’ on E onto a corresponding point 
P on. Parallels map to horizontal circles that become horizontal lines when 
we unwrap the sheet, and meridians to vertical lines. The equator is mapped to 
itself, and areas near the North and South Poles become greatly enlarged. 

Another cylindrical projection projects each point P’ on E horizontally 
onto the nearest point P on zr. Again, parallels map to horizontal circles that 
become horizontal lines when we unwrap the sheet, and meridians to vertical 
lines. This projection preserves the relative sizes of areas, so it is called an 
equal area projection. 

The well-known Mercator projection is a cylindrical map where the paral- 
lels are horizontal and the meridians vertical, but the spacing of the parallels is 
chose cunningly so that paths of constant compass bearing (that is, paths mak- 
ing the same angle with every meridian they cross) are shown as straight lines. 
This projection is invaluable in navigation, since sailors and aircrew using it do 
not need to constantly change the compass bearing of the course they follow. 


Conical projections 


Conical projections of E onto z are ones where we project E onto a sheet 7 
that is rolled in the form of a cone round E, touching it at some parallel; then 
we slit the sheet along a meridian and flatten it, taking on the shape of a sector 
of a circle. 


We may, for example, project each point P’ on E to the corresponding point 
P on az radially out from the centre of E, or horizontally out from the axis 
of the cone to the nearest point on z. Parallels map to arcs of circles whose 
centres are at the vertex V of the flattened cone, and meridians to straight lines 
radially out from V. Conic projections are particularly useful for mapping large 
areas in the mid-latitudes that are wide from east to west (such as Canada). 

There are many variations of these fundamental types, all designed for 
special purposes; but a full discussion is beyond the scope of this book. 
Map-making is a fascinating subject! 
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Gerardus Mercator 
(Latinized form of Gerard 
Kremer) (1512-1594) was 
a Flemish map maker who 
devised this projection in 
1569, and introduced the 
term ‘atlas’. 


You should look at a good 
Atlas now to see the 
various types of 
projections that it uses. 
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7.6 Exercises 


Section 71 
1. (a) Determine the ae of the great circle on S? that passes through the 
2 12 1 
points (Je, —Jg, Je) and (—e. es Je): 


(b) Determine the seca of the little circle on S* that passes through the 


points (2, 5 Tei =), (0,0, 1) and (=, 3 Ye’ +2). 


V3 _v3 _1 ) 
2. Determine the spherical polar representation of the point (4 ee bat 2 


on S*. Hence state the latitude, colatitude, and longitude of this point. 
3. Determine the coordinates on S? of San Francisco (38° N, 123° W) and 
Brisbane (27° S, 153° E). 


Section 72 


. é . 3 4 1 
1. Determine the images of the point (35 578 -+;) 
R(X, 3), R(Y, 32) and R(Z, 37). 


2. Determine the matrix A of a rotation x+> Ax of S? that maps 
(Saxe va) © (sea): 
5/2" 5/2’ /2 5/2” afi J2 
calculation. 
3. (a) Determine the matrix A such that the mapping r : x + Ax represents 
reflection of S* in the plane z with equation x — 2y +z = is 


J14’ Fa 14° /14 Fa): 
(c) Prove that the midpoint of P and r(P) lies in the plane z. 


4. Let a triangle APOR on S” have sides PQ of length r, QR of length p, This is the converse of 
and RP of length g, and angles a, 6 and y at the vertices P,Q and R, Theorem 6 of Subsection 
respectively; let a = 6. Prove that p = q. 7.2.2. 

5. Express the rotation R(X, 7) of S? as the composition of two reflections 
of S? in great circles. 


under the mappings 


Verify your answer by direct 


(b) Hence determine the image under r of the point P (4 


Section 73 


1. Construct the following figures on S*: 
(a) aspherical triangle of area (27 — 0.1); 
(b) a lune of area 7/3. 

2. An equilateral strict spherical triangle has area 75. Determine: 
(a) the magnitudes of its angles; 
(b) the lengths of its sides; 
(c) the area of a dual triangle. 

3. Estimate the distance between Beijing (40° N, 117° E) and Lima (12° S 
77° W), taking the radius of the Earth as 4000 miles. 

4. Let AABC be a triangle on S* in which the angle at C is a right angle. If 
a, b and c denote the lengths of BC, CA and AB, and a and 6 denote the 
angles ZCAB and ZCBA, prove that: 
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(a) cosc = cota x cot B; 
(b) cos 8B = tana x cotc = cosb x sina; 
(c) sina = tanb x cot B = sina x sinc. 

. Let the spherical triangle AABC have sides AB, BC and CA of lengths 1, 
0.6 and 1.3, respectively. Estimate the length of the perpendicular from B 
to CA. 

. Let the figure ABCD on S? be a strict spherical ‘square’ — that is, have 
four sides of the same length r and four angles of the same magnitude a. 
Determine r in terms of a. 

. Prove that the distance d on S* between the points with spherical polar coor- 
dinates (cos ¢; sin 6), sin dg; sin6;,cos@;) and (cos @2 sin 62, sin @2 sin 62, 
cos 62) is given by. 


cos d = sin 0, sin 62 cos(¢, — $2) + cos 6; cos 62. 


Section 74 


1 


Let P and Q be the points (4. 3) and (-1. -t) in R?, and P’ and Q’ be 

the corresponding points on S* under the mapping ~!. Use Theorem 1 of 

Subsection 7.4.1 to determine the spherical distance P’Q’. 

. Determine the Mobius transformation that is conjugate to the rotation 

R(Y, F) of S?. 

. (a) Prove that a Mébius transformation that maps 0 to co and oo to 0 is of 
the form z > x, for some non-zero complex number K. 

(b) Prove that a Mobius transformation M that maps a to b and b to a 
(where a 4 b) maps the family </ of Apollonian circles defined by the 
point circles a and b to itself, and maps exactly one circle in to itself. 


Summary of Chapter 7 


Introduction 


The (Elliptic) Parallel Postulate Given any line @ and a point P not on @, all 
lines through P meet @. (That is, there are no lines through P that are parallel 
to £.) 


Section 71: Spherical Space 


1. The unit sphere in R? is S? = {(x, y,z) : x? + y? +2? = 1}. Each point 
(x, y,z) on S* has an antipodal point (—x, —y, —z). The North Pole of 
S? is N (0,0, 1), and the South Pole is S (0,0,—1). The equator is the 
intersection of S* with the horizontal plane through the origin. 

2. A great circle is the circle cut out on S? by a plane through the centre 
ore 
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A ‘strict’ spherical square 
is one whose whole can be 
seen at one time after a 
suitable rotation of S2. 


Sometimes this formula 
can save you a bit of 
arithmetic. 
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A circle of longitude is a great circle through N. The Greenwich 
Meridian is the part of the circle of longitude through A (1,0,0) which 
has positive x-coordinates. 

A circle of latitude is the curve of intersection of S? with a plane par- 
allel to the equator. A little circle is the curve of intersection of S? with a 
plane that is not a great circle. 

3. A line on S? joining two points P and Q, or a line PQ, is either arc with 
endpoints P and Q of a great circle through P and Q. 

4. The length of a line on S? is the Euclidean length of the corresponding 
circular arc; the distance (or spherical distance) between two points P 
and Q on S? is the length of the shorter of the two lines PQ, and is often 
denoted simply as PQ. 

For any three points P, Q and R on S? that do not lie on a single great 
circle, the lines PQ, OR and RP form a triangle APOR. 

5. Great circles are geodesics on S 2. that is, curves of shortest distance 
between points. 

6. The distance between any two points P and Q on S? is equal to the angle 
that they subtend at the centre of S*. There is always a great circle joining 
any two points on $7. 

Spherical distance possesses all the natural properties that a distance 
function should possess. 

7. Any two lines on S* meet in two points. 

8. Corresponding to each point P on S* the great circle associated with P 
is the great circle of points 2 / 2 distant from P. Conversely, for each great 
circle there are two points of S* which play the role of its ‘poles’. 

9. For any two points P and Q on S”, there is a rotation of S* that maps P 
to Q. 

10. Each point P on S* has spherical polar coordinates (cos ¢ sin, sin 
sin9,cos@), where 0 < @ < 27 andO <6 <7. 

The colatitude of P is the angle 6; the latitude of P is 0’ = > —@. The 
longitude of P is the angle ¢. 

The distance of P from the North Pole is its colatitude 6. 

11. A circle on S? is the set of points on S? that are equidistant from a given 
point of S*; it is either a little circle or a great circle. 


Section 72: Sperical transformations 


1. An isometry of S* is a mapping of S* to itself that preserves distances 
between points. The isometries of S* form a group, the group of spherical 
isometries, denoted by the symbol S (2). 

A direct isometry of S* preserves the orientation of angles on S*; an 
indirect isometry of S* reverses them. Reflection of S? in a plane through 
the centre of S* is an indirect isometry of S*. 

2. The three elementary rotations of S? are its rotations about the coordinate 
axes. 


Summary of Chapter 7 


The transformation 
RZ. SSS 


xbh> Ax 
x cosy —siny 0 
where x = | y | and A= | siny cosy 0 | is a rotation of S* 
z 0 0 1 


about the positive z-axis through an angle y. 


The transformation 
RiXa? Fae 


xh Ax 
x 1 0 0 
where x = | y | and A = | 0 cosa —sina | is a rotation of S* 
z 0 sine cosa 


about the positive x-axis through an angle a. 


The transformation 
R(Y,p): Ss 8? 


xh Ax 
x cosB OO sing 
where x = | y | and A = 0 1 0 is a rotation of S? 
v4 —sinB O cosp 


about the positive y-axis through an angle f. 


. The inverse of an elementary rotation through a given angle is an ele- 


mentary rotation through the same angle but in the opposite direction. Its 
matrix is the transpose of the matrix of the original elementary rotation. 


. Arotation of S? that maps A (1,0,0) to P (cos ¢ sin @, sin ¢@ sin 8, cos 8) is 


given by the composition R (Z,@) R iY -6’), where 6’ = . — 6. 


. The equatorial plane of S? is the plane z = 0 that contains the equator 


of S*. Reflection in the equatorial plane of S? is the mapping x +> Ax 
that sends the point x = (x,y,z) to the point (x, y,—z); it has matrix 


1 0 0 
A=] 0 1 0O 
0 0 -!1 


. Reflection of S? in the plane with equation ax + by + cz = 0, where 


a” +b? + c* = 1, is given by the mapping x +> Ax where 
1—2a? -2ab = —2ac 

A=|{ -2ab 1-2b?  —2be 
—2ac —2be 1-2? 


. The product of any two reflections of R? in planes through O that meet in 


a common line is a rotation about that common line. 


. Every rotation is a product of two reflections; these reflections are in 


planes that have the axis of rotation as their common line, and are 
separated by half the angle of the rotation. 


. The image of a circle on S? under an isometry of S* to itself is a circle. 
. The Product Theorem Every isometry of S* is a product of at most three 


reflections in great circles. 
An isometry of S$? is known completely when its effect on three points 
(which do not lie on a common great circle) is known. 
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11. Isosceles Triangle Theorem Let a triangle APOR on S” have sides PQ 
of length r, QR of length p, and RP of length q, and let p = q. Also, let 
the angles at the vertices P,Q and R be a, # and y, respectively. Then 
a= B. 

12. Every isometry of S* is a reflection, a rotation, or the composite of a 
reflection and a rotation. 

Every direct isometry of S* is a rotation, and vice versa. 


Section 7.3: Spherical Trigonometry 


1. Given any three angles a, 6 and y with (S57 >)a+6+y > a, there 
exists a spherical triangle with those angles. 

2. The sum of the angles of any spherical triangle is greater than 2. The 
angular excess of a spherical triangle is the difference between the sum 
of the angles of the triangle and z. 

3. The strict line segment PQ is the shorter of the two line segments PQ. 
(Uf both are of length 7, we choose either in some explicit way.) 

The strict triangle APQR is that triangle APOR whose sides are strict 
line segments, and its inside is the region of S? that they bound; we can 
rotate S* so that the whole of the strict triangle can be seen at one time. 

4. A lune is a 2-sided polygon bounded by two great circles 
. The area of a spherical triangle is equal to its angular excess. 

6. Dual Triangles Theorem With every strict spherical triangle A there is 
associated another triangle A’, called its dual, whose angles are the com- 
plements of the sides of A and whose sides are the complements of the 
angles of A. (The ‘complement’ of a quantity x is the quantity z — x.) 

7. The distance d on S* between the points (pj, p2, p3) and (q1,q2,q3) is 
given by cosd = p1qi+p2q2+p3q3. The distance between corresponding 
points on the surface of the Earth is then given (in miles) by the scaling 
map x +> 4000x. 

8. Pythagoras’ Theorem Let AABC be a triangle on S? in which the angle 
at C is aright angle. If a, b and c are the lengths of BC, CA and AB, then 
cosc = cosa x cosb. 

Also sina = sina /sin c and tana = tan a/sin b. 
9, We can always drop a perpendicular from a point on S? to a line on S?. 
10. Let AABC be a strict triangle on S*, in which the sides AB, BC and CA 
have lengths c, a and b, respectively, and the angles at A, B and C are 
a, B and y, respectively. Then 

Sine Rul sna — sng _ ny, 

Cosine Rule for Sides cosc = cosacosh+ sinasinbcos y ; 

Cosine Rule for Angles cosy = sina sin 6B cosc — cosacosB . 
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Section 74: Spherical Geometry and the Extended Complex 
Plane 


1. The image of the point P (cos ¢ sin @, sin ¢ sin @, cos @) under stereographic 
projection z has coordinates (cos d [tan (58). sin o/ tan (58)). 


Planar Maps 


2. 


Let a point P’ on S* map under stereographic projection to a point P in R? 
with coordinates (r cos @,rsin@). Then the spherical distance of P’ from 
the South Pole S is 2tan~! (r). 
Let P’ and Q’ be points on S* that are mirror images under reflection in 
a great circle C’, and let the images of P’, Q’ and C’ under stereographic 
projection z be P, Q and C, respectively. Then P and Q are inverse points 
with respect to C. 
Let R denote a rotation of S*, and x stereographic projection. Then the 
transformation 2 Rz~! of the extended plane is the conjugate transfor- 
mation of the rotation R. 

The conjugate transformation of a rotation of S? is a Mébius transforma- 
tion of the extended plane. 
A Mobius transformation M that is conjugate to a rotation of S? is of the 
form M (z) = (az+b) /(—bz +a), where a,b € C. 

A Mobius transformation can have at most two fixed points. 
The circle in R* with equation x? + y* + 2ax + 2By + y = O and the 
plane with equation 27X + 26Y + (1—y)Z+(.+y) = Oare said to be 
associated. 


Section 75: Planar Maps 


1. 


It is not possible to map a portion of the Earth’s surface onto a planar map 
in such a way that relative distances are preserved. There is no ideal planar 
map. 


. Azimuthal projections of the Earth E onto a flat sheet of paper z are ones 


where we project E onto mz placed tangent to E at some point, often the 
South Pole S. 

The gnomic projection uses the centre of E as the centre of projection, 
stereographic projection uses the North Pole of E as the centre of pro- 
jection, and orthogonal or orthographic projection uses rays from infinity 
parallel to the polar axis. 


. Cylindrical projections of E onto z are ones where we project FE onto a 


sheet 7 that is rolled in the form of a cylinder round E, touching it at its 
Equator; then we slit the sheet along a meridian and flatten it. 


. Conical projections of E onto z are ones where we project E onto a sheet 


z that is rolled in the form of a cone round E, touching it at some parallel; 
then we slit the sheet along a meridian and flatten it, taking on the shape of 
a sector of a circle. 
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0 The Kleinian View of Geometry 


We now describe in detail what we have called the Kleinian view of geometry. 
As we said earlier, this is a powerful way in which the geometries we have 
presented can be unified and seen as different aspects of the same idea. 


8.1 Affine Geometry 


Klein argued that in any geometry there are properties of figures that one dis- 
cusses, but these properties vary from one geometry to another. In Euclidean 
geometry, segments have lengths, lines cross at angles one can measure; it 
makes sense to ask if a given triangle is equilateral, and so on. But none of 
these properties makes sense in affine geometry. There one may speak only of 
ratios of lengths along parallel lines, and all triangles are (affine) congruent. 
Klein interpreted this as follows: in each geometry we are studying the same 
space of points, the familiar plane R*. In Euclidean geometry we admit only 
the transformations that preserve length. These are transformations of the form 


t(x) = Ux+a, 


where U is an orthogonal 2 x 2 matrix anda € R?. 

Because every transformation in Euclidean geometry preserves lengths, and 
therefore angles, it makes sense to speak of distance in this geometry. 

In affine geometry, however, we admit all transformations of the form 


t(x) = Ax+b, 


where A is an invertible 2 x 2 matrix and b € R?. Because every transformation 
in affine geometry preserves ratios of lengths along parallel lines, it makes 
sense to speak of such ratios in this geometry. But because there are affine 
transformations that do not preserve length, it does not make sense to speak of 
distance in this geometry. 

Klein observed that rather more is true. In both geometries, the allowable 
transformations form a group. This was a novel observation at the time, for 
Group Theory as such was only just being discovered. Moreover, the group of 
Euclidean transformations is a subgroup of the group of affine transformations. 
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Chapter 0 


Affine Geometry 


This observation enabled Klein to unify the two geometries. He said that both 
geometries concern the same space R?, that each geometry involves a group of 
transformations which act on this space, and that the Euclidean group is a sub- 
group of the affine group. In this sense Euclidean geometry is a subgeometry 
of affine geometry. 

The hard work comes with the remaining geometries. How can affine geom- 
etry and projective geometry be unified? Projective geometry is not about the 
plane R? but about projective space RP”. Its transformations are written as 
3 x 3 homogeneous matrices, not as 2 x 2 matrices. To accomplish the uni- 
fication, we must first establish that the space IR? can be regarded as a subset 
of RP’. 

To do this, we consider the subset of Points in RP? that pierce the standard 
embedding plane z = 1. We shall temporarily denote this subset by RP%, 
so that 


RP} = {[x, y,z] : z 4 O}. 


Now each point (x, y) in R* can be identified with the Point [x, y, 1] in RP?: 
and, conversely, each Point [x, y,z] = [x/z, y/z, 1] in RP5 can be identified 
with the point (x/z,y/z) in R?. This provides us with a one-one corre- 
spondence between IR* and RP5 which enables us to regard R? as a subset 
of RP*. 

Next (following Klein) we prove a result which shows that the affine group 
can be considered as a subgroup of the projective group. 


Theorem 1 The group of affine transformations can be identified with the 
subgroup of projective transformations that map RP onto itself. 


Proof A projective transformation which maps RPs onto itself is also 
one that maps the ideal Line z=0 onto itself. It is therefore sufficient to 
characterize the projective transformations which map ideal Points to ideal 
Points. 

It is clear that the set of projective transformations which map the Line z = 0 
onto itself is a group. For, if ¢ is a transformation which maps ideal Points to 
'. and, if t and s are projective transfor- 
mations which map ideal Points to ideal Points, then so is t os. We denote this 
group by Po(2). 


ideal Points, then so is its inverse f~ 


abe 
LettA=|d ef } bea matrix associated with a projective transforma- 
g hk 


tion t. Then f maps ideal Points to ideal Points if and only if 


a b c\ (x 
de f}jj{y}={?], 
g h k/ \O 
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Projective 


Here, homogeneous 
means that the matrix can 
be multiplied by any 
non-zero real number 
without affecting the 
transformation. 


Recall that the group of all 
projective transformations 
is denoted by P(2). 
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where we do not care what numbers the ?’s stand for, so long as they are not 
both zero. It follows that t maps ideal Points to ideal Points if and only if 


gexthy+0=0. 


This must hold for all x 4 0 and all y 4 0, so g = h = 0. Since A is invertible, 
it follows that det A = k(ae—db) # 0, sok 4 Oand (ae—db) # 0. Moreover, 
since the matrices associated with a projective transformation are determined 
only up to a non-zero real multiple, we can divide A by k. If we then rewrite 
a/k as a, b/k as b, and so on, we see that the transformations in Po(2) are 
precisely those projective transformations which have an associated matrix of 


the form 
abe 
de f |, whereae—db#<0. 
0 0 1 


Let us now consider the effect that such a transformation has on Points in RP%. 
If [ae Jy, 1] is the image of [x, y, 1] under the transformation, then 


x! ab c\ (x ax +by+c 
yJ=(4 e fl lypalaxteyt+f 
1 0 0 1 1 1 


This equation provides us with a relationship between x, y and x’, y’ which we 
can rewrite in the form 


HY) fa bY (x aa 
vy} \ad e}\y ‘ea 
Since ae—db 0 it follows that we can map the point (x, y) associated with 
the Point [x, y, 1] to the point (x’, y’) associated with the Point [x’, y’, 1] by an 


affine transformation. We conclude that the group Po(2) may be identified with 
the affine group A(2). | 


It follows that the space R? of affine geometry can be considered as a subset Catfine > 
of projective space RP?, and that the group A(2) of affine geometry can be 
considered as a subgroup of the projective group P(2). In other words, affine 
geometry is a subgeometry of projective geometry. 

It is important to realize that in the process of extracting affine space from 
projective space we lose all the ideal Points on the ideal Line, and that this 
omission can have a significant effect on figures. For example, in projective 


Projective 


geometry any two Lines meet at a Point. But if the Point of intersection is an 
ideal Point, then the corresponding affine figure consists of two lines that do 
not meet. This gives rise to the affine concept of parallel lines, which is mean- 
ingless in projective geometry. The fact that ‘parallelism’ is an affine property 
can be checked easily by recalling that an affine transformation corresponds 
to a projective transformation which maps ideal Points to ideal Points. Under 
such a transformation, Lines which intersect at an ideal Point map to Lines 
which intersect at an ideal Point. This is an alternative proof of the following 
theorem that we first proved in Chapter 2. 


Affine Geometry 


Theorem 2 An affine transformation maps parallel straight lines to paral- 
lel straight lines. 


Parallelism is not the only new property to arise from the omission of ideal 
Points. In affine geometry, three distinct types of (non-degenerate) conic arise 
because there are three ways in which a projective conic can meet the ideal 
Line. In the first figure below, the projective conic and the Line cross at two 
Points; the corresponding picture in R? is that of a hyperbola. In the second 
figure, the projective conic and the Line touch at one Point; the corresponding 
picture in R? is that of a parabola. In the third figure, the projective conic and 
the Line do not meet at any Point; the corresponding picture in R? is that of an 
ellipse. 


two 
ideal 
Points 


So hyperbolas, parabolas and ellipses are plane conics which correspond 
to projective conics with two, one and no ideal Points missing, respectively. 
Since affine transformations correspond to projective transformations which 
map ideal Points to ideal Points, they cannot change the number of ideal Points 
on a projective conic, and hence cannot change the type of the plane conic. This 
is an alternative proof of the following theorem first proved in Chapter 2. 


Theorem 3 Affine transformations map ellipses to ellipses, parabolas to 
parabolas, and hyperbolas to hyperbolas. 


So far, we have discussed some affine properties that do not make sense in 
projective geometry. By contrast, any property in projective geometry must 
also be a property in affine geometry. Indeed, any property that is preserved 
by the group of all projective transformations must also be preserved by the 
subgroup of all affine transformations. 

For example, tangency is a property of projective conics, so every pro- 
jective transformation maps tangents to tangents. In particular, a projective 
transformation which corresponds to an affine transformation must map a tan- 
gent whose Point of contact is not an ideal Point to a tangent whose Point of 
contact is not an ideal Point. Interpreting this observation as an affine result, 
we obtain the following theorem which we first met in Chapter 2. 


Subsection 2.3.3, 
Theorem 3 


Subsection 2.5.1, 
Theorem 4 
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Theorem 4 Let f be an affine transformation, and let £ be a tangent toa —- Subsection 2.5.1, 
conic C. Then f(€) is a tangent to the conic r(C). Theorem 7 


Notice that in this proof of the theorem we did not have to consider tangents 
whose Point of contact is an ideal Point. But to what does such a tangent 
correspond in R*? The answer depends on how many ideal Points lie on the 
projective conic. 
A projective conic which passes through one ideal Point corresponds to a 
parabola in R. Since such a projective conic meets the ideal Line at precisely 
one Point, the tangent at that Point must be the ideal Line z = 0. This tangent Recall that a Line is a 
does not correspond to anything in R*! tangent to a projective 
More interesting are those projective conics which pass through two ideal iC if it meets the conic 
Points. Such a projective conic corresponds to a hyperbola in R?. The tangents at yncney one Pome 
which touch the projective conic at its two ideal Points correspond to lines in 
IR’, and these lines are defined to be the asymptotes of the hyperbola. 


standard 
embedding 
plane z= | 
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tangents at the 2 
ideal Points of the ae 


projective conic 


Since affine transformations correspond to those projective transformations 
which map ideal Points to ideal Points, it follows that tangents at ideal Points 
map to tangents at ideal Points, and hence that asymptotes map to asymptotes. 


This proves the following theorem which you met in Chapter 2. Subsection 2.5.1, 
Theorem 6 


Theorem 5 Let ¢ be an affine transformation, and let H be a hyperbola 
with asymptotes £; and £2. Then t(H) has asymptotes ¢(€1) and t(€2). 


We now summarize what we have done so far. 
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Geometry Space Transformations Invariants 
Projective RP* = R? ¢: [x] [Ax], incidence, 
+ ideal Line where A is a3 x 3 collinearity, 
invertible matrix cross-ratio 
Affine R? t:xh Ax+b, ratios along parallel 
where A isa2 x 2 lines 
invertible matrix 
Euclidean R? t:xr> Ux+a, length, angle Projective 


where U is a2 x 2 
orthogonal matrix 


We can think of the geometries in this table as forming a hierarchy of geome- 
tries with projective geometry at the top. Any property of a geometry in the 
hierarchy is also a property of the geometries further down the hierarchy. Also, 
theorems that are valid in one geometry in the hierarchy also hold in geometries 


further down. 
affine geometry 


more transformations 


fewer properties 


fewer congruence classes 


As we move up the hierarchy, the group of transformations becomes larger 
and so the congruence classes of figures merge together as more and more 
figures become congruent to each other. In Euclidean geometry, two triangles 
are congruent if and only if they have the same side lengths, and two conics 
are congruent if and only if they are of the same type (hyperbola, parabola or 
ellipse) and have the same shape and size. In affine geometry, any two triangles 
are congruent, and conics are congruent if and only if they are of the same type. 
In projective geometry, any two triangles are congruent and any two projective 
conics are congruent. 


8.2 _Projective Reflections 


The cross-ratio enables us to introduce a type of projective transformation that 
generalizes reflection in Euclidean geometry. 


Definitions Let F be a Point that does not lie on a Line ¢. Then a 
projective reflection in @ with centre F and non-zero parameter k is a 
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mapping 
r:RP* > RP? 
PR@Q 


where 


(@) OQ = JP, itt IP Ibe om 23 
(b) the cross-ratio (FRPQ) = k, where R is the Point of intersection of F P 
and £. 


Remark 


Provided that P does not lie on @, then by varying the parameter k we can 
arrange that the Point Q is any Point on the Line FP other than the Points F, R 
and P. 


We now indicate why this map is called a projective reflection. 

Let ¢’ be the representation of a Line @ in an embedding plane zr, and con- 
sider all the lines in that are perpendicular to ¢’. Since these lines are parallel, 
they must correspond to Lines in RP? which meet at an ideal Point, F say. Let 
r denote the projective reflection in € with centre F and parameter k = —1. 
Then, in the embedding plane z the effect of r is to map each (non-ideal) point 
P’ to its Euclidean reflection in 0’. 

r fixes the Points of @ so that its effect in z is to fix the points of ¢’. 

Next consider a Point P that does not lie on @, and let Q = r(P). Then PQ 
passes through F and meets @ at the Point R, where (FRPQ) = —1. For the 
embedding plane zr, F is an ideal Point, so that 

ol R’ 

P’ R’ 
(where R’, P’ and Q’ are the representations of R, P and Q in zr). The point R’ 
is therefore the midpoint of P’Q’. Moreover, since PQ passes through F, P’ Q’ 
must be perpendicular to €’, so Q’ is the Euclidean reflection of P’ in ¢’, as 
suggested. 

We now relate projective reflections to projective transformations. 


(FRPQ) = | 


Theorem 1 A projective reflection is a projective transformation. 


Proof Consider the projective reflection r in the Line @ with centre F and 
parameter k, and let Q = r(P) where P is a Point of RP* that does not belong 
to €U{F}. Then (FRPQ) = k, where R is the Point of intersection of PQ 
and £. 

By the Fundamental Theorem of Projective Geometry there is a projective 
transformation t which maps F to F’ = [0,0, 1] and which maps the Line £ to 
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the Line ¢’ with equation z = 0. (This can be achieved by mapping a pair of 
Points on @ to [1,0,0] and [0, 1, 0].) If P’ = t(P), O’ = t(Q) and R’ = t(R), 
then, since ft preserves cross-ratio, it follows that (F’R’P’Q’) = (FRPQ) = k. 
Now, if P’ = [a,b,c], then the Line through P’ and F’ has equation 
bx = ay, and so it meets the Line z = 0 at R’ = [a,b,0]. As Q’ also lies 
on the Line bx = ay, we can write Q’ = [a,b,d], for some d; here d # 0, 
since Q’ does not lie on £. 
Since 
(a,b,c) = 1- (a,b,0) + c- (0,0, 1) 
and 
(a,b,d) = 1- (a,b,0) +d- (0,0, 1), 
if follows that k = (F’R'P'Q’) = c/d, so that d = c/k. Hence the mapping 
s : RP? }& RP” which sends P’ to Q’ is the projective transformation defined 
by 
s([a,b,c]) = [a,b, c/k]. 
Also, s fixes the Points of £’ U {F’}. Thus the transformation r which maps 
P to Q must be equal to the composite t~! o s o t and so must be a projective 
transformation, as required. |_| 


Remarks 

1. If the parameter k = 1, then s and hence r = t os oft! are the identity 
map. 

2. If k ~ 1, then the only Points fixed by r are F and the Points of @; for, if 


s([a,b,c]) = [a,b,c/k] = [a,b,c], 


then either c = 0 (in which case [a, b,c] lies on ¢’) or a = b = 0 (in which 
case [a,b,c] = F’). 
3. If k = 0, then s, and hence r, is not a projective transformation. 


We showed earlier that every projective transformation can be expressed as 
a composite of perspectivity transformations. In fact, a similar result holds for 
projective reflections. 


Theorem 2 Every projective transformation can be expressed as a com- 
posite of a finite number of projective reflections. 


8.3 Hyperbolic Geometry and Projective Geometry 


First we introduce a new way of representing lines in hyperbolic geometry 
that will be useful in explaining how hyperbolic geometry is related to pro- 
jective geometry. In this representation lines appear straight, but angles are no 
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- 


Theorem 4, 
Subsection 3.3.4 


We omit a proof, since we 
shall not use this result. 
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longer represented accurately. The realization that this could be done was one 
of the crucial discoveries made by Klein around the time he was preparing his 
Erlangen Program. 

To obtain the new representation we use two mappings, stereographic pro- 
jection and orthogonal projection. Stereographic projection from the North 
Pole of S* maps the Southern hemisphere onto the interior of the unit disc J, 
and the equator to the boundary of Y. Its inverse therefore maps Y onto the 
Southern hemisphere. It is an angle-preserving map, so it sends an arc of a gen- 
eralized circle in Y, which is perpendicular to the equator, to an arc of a circle 
on the sphere perpendicular to the equator — such an image may be thought of 
as a circular arc ‘hanging vertically downwards’. 

Orthogonal projection maps each point (x, y,z) in R? along a line parallel 
to the z-axis to the point (x, y,0) in the (x, y)-plane. It maps the Southern 
hemisphere and the equator back onto Y. It maps a circular arc on the sphere 


perpendicular to the equator onto a segment of a straight line. 


Definition The linearization map L of the unit disc Y onto itself con- 
sists of the inverse of the stereographic projection map followed by the 
orthogonal projection map. 


Then L fixes every point of the boundary of the unit disc Z, and maps arcs of 
generalized circles perpendicular to the boundary of Y to segments of straight 


Three hyperbolic lines Their images under L 
in the Poincaré model 


The linearization map transforms the Poincaré model of hyperbolic geome- 
try into a new model of hyperbolic geometry in which d-lines are represented _ It is important to realize 


by straight line segments. We call this new model of hyperbolic geometry that although d-lines look 
straight in this model, 
angles are not represented 
accurately. 


the Beltrami—Klein model, after the Italian mathematician Eugenio Beltrami 
(1833-1900) who discovered it, and the German mathematician Felix Klein, 
who connected it to projective geometry. 

By considering the definition of L as the composite of the inverse of stereo- 
graphic projection followed by orthogonal projection, it is clear that the image 
of the d-line joining two points on the boundary of Z is the straight line seg- 
ment joining the points. To find the image of a point P of Y under L, consider 
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a d-line 


two d-lines through P and locate their end-points on the boundary of Y. Then 
L(P) is the point where the corresponding chords intersect. Notice that, if it 
helps, we can always assume that one of the d-lines is a diameter of J. 


image of d-line 
under L 


Example 1 Find the image under L of the point P(r, 0) in J. 


Solution Notice first that it is sufficient to consider the image under L of the 
x-axis and the d-line through P at right angles to the x-axis. 

The transformation L maps the x-axis to itself (as a line, not pointwise). 
The d-line through P perpendicular to the x-axis is part of the circle C with 
equation 

x? 4+ y? —2ax +1=0. (1) 


Since P(r,0) lies on C, it follows from equation (1) that r? —2ar +1=0,s0 
that a = anak 
The circle C with equation (1) meets the boundary of Z where 


x? + y?—2ax+1=0 and r+y=l. 


The chord of Y through the two points of intersection of C and the boundary 
of J is found by subtracting these two equations, and so it has equation 


2—2ax=0, or ax=l1. 


It follows that the image L(P) of P is the point (4. 0) = (25. 0). 


According to the Kleinian view of geometry, to define a geometry we must 
specify a space of points and a group of transformations acting on that space. 
Our new Beltrami—Klein model of hyperbolic geometry agrees with the (ear- 
lier) Poincaré model as far as the space of points is concerned: the space is the 
unit disc Y. The group of transformations is different, although isomorphic 
to the original group; the new group consists of all transformations of F to 
itself of the form LtL~!, where t is a hyperbolic transformation. Clearly, if t 
is a hyperbolic transformation mapping Y to itself and d-lines to d-lines, then 
LtL~' is a transformation mapping J to itself and chords to chords. 

To connect transformations of the form LtL~! with projective geometry, we 
shall show that every such transformation Lt L~! is a projective transformation 
which maps @ to itself and Y to itself. It is sufficient to prove this in the case 
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av 


The centre of C is on the 
X-axis, so there is no 
y-term in equation (1). 


€={(x,y): x*+y7=1} 
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that ¢ is a hyperbolic reflection; for, if f = t) o f2 o...0 ft, is a decomposition 
of ¢ into hyperbolic reflections then it follows that 


LtL“! =(LonoL"')o(LonoL!)o...0(LomoL) 


is a decomposition of LtL~! into projective transformations with the required 
properties; then it follows that LtL~! is a projective transformation with the 
required properties. 


Theorem 1 Let ¢ be the hyperbolic reflection in the d-line d joining the 
boundary points A and B of Y, and let F be the pole of the (Euclidean) 
line £ through A and B. Then LtL~! is the projective reflection r in @ with 
centre F and parameter —1. 


Proof First we show that LrL—! agrees with r on @. To do this, observe that 
lines through F are perpendicular to d, so the hyperbolic reflection r maps a 
given point C of @ to the other point D of @ that lies on FC. Since L maps @ 
to itself, it follows that LtL~!(C) = D. 

Now let E be the point of intersection of FC with €. Then, since F is the 
polar of & with respect to @, it follows that the cross-ratio (FECD) = —1. 
Hence 

HeE}=D Sia C), 


as required. 

Next we show that LtL—! agrees with r on Y. To do this, let P be an 
arbitrary point of Z. Then let AP meet @ at D, and let AC meet FP at Q. We 
will verify that r(P) = Q. 

We know that r(A) = A and that r(D) = C, so that r must map AD to 
AC. We also know that r maps the line FP to itself, so that it must map P (the 
point of intersection of AD and FP) to the point of intersection of AC and FP), 
namely the point Q. Thus r(P) = Q. 

Now we consider the transformation LtL~!. It certainly maps chords of 
to chords; so, to find LtL~! (P) we may take AD and FP as two Lines through 
P, and find their images. The images of AD and FP are AC and FP, so that the 
image of P is Q. That is, LtL~!(P) = Q. 

Thus LtL~! and r agree on both & and J. a 


We have thus shown that every hyperbolic transformation tf corresponds to 
a projective transformation LtL~! which fixes @ and maps its interior J to 
itself. In a similar way we can verify that every projective transformation that 
fixes @ and maps J to itself corresponds to a hyperbolic transformation. 

This shows that hyperbolic geometry can be considered as the sub-geometry 
of projective geometry in which the space of points is the set of points inside 
the fixed projective conic @, and its group of transformations corresponds to 
those projective transformations mapping @ to itself and the inside F of @ to 
itself. 
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By the result of Exercise 7 
on Subsection 4.3, in 
Subsection 4.6. 


Here we think of @ as 
being embedded in a 
suitable embedding plane. 


We omit the details of this. 


We defined the inside of a 
projective conic in 
Subsection 4.1.2. 


Hyperbolic Geometry and Projective Geometry 


In fact, since all non-degenerate conics are projectively equivalent, we can 
replace @ in the above discussion by any non-degenerate conic E. This may be 
achieved by replacing the linearization map L by the composite map soLos~!, 
where s is a projective transformation that maps @ onto E. 


Defining Hyperbolic Distance in the Projective Setting 


The fact that hyperbolic geometry may be considered a sub-geometry of pro- 
jective geometry enables us to give a new method for defining distance in 
hyperbolic geometry. Projective transformations preserve cross-ratio, which 
involves four Points; distance involves only two points. So we shall have a 
ready-made sense of distance in the Beltrami—Klein model of projective geom- 
etry as soon as we can connect two interior Points of a projective conic (for 
simplicity, we shall take @ as our projective conic) with a set of four collinear 
Points. This is easy — we join the two given Points P and Q by a Line that 
meets the projective conic in the Points A and B. 

We know that the cross-ratio (ABPQ) is unaltered by any projective trans- 
formation, and so in particular by the projective transformations that map the 
interior of the projective conic F to itself. 

We now investigate whether we can sensibly define the hyperbolic distance 
d from P to Q by the formula d(P, Q) = (ABPQ). The first thing we should 
want is that distance along a line is additive: that is, if R also lies on the d-line 
through P and Q, and R lies between P and Q, then is it true that 


d(P,Q) =d(P,R) + d(R, Q)? 


It follows from the definition of cross-ratio that 


AP [AQ 
d(P, Q) = (ABPQ) = 55 OB’ 
d(P,R) = (ABPR af ae 
(P,R) = ¢ ) = Sa op? 
and 
AR /AQ 
COG) = CBR) = op OB 


Hence, if we use the definition d(P,Q) = (ABPQ), then d(P,Q) 4 
d(P,R)+d(R, Q) but instead d(P, Q) = d(P, R) - d(R, Q). This suggests 
that in the Beltrami—Klein model we should instead define distance between 
arbitrary points P and Q of & by the formula 


d(P,Q) =klog,(ABPQ), for some number k, 


since then distance d is additive. (In fact, the arbitrariness in the choice of 
the constant k is analogous to the freedom we have to choose the radius of 
the sphere in spherical geometry. However, we choose the sign of k so that 
distances are non-negative.) 

To define a distance function d for the Poincaré model, let the conic E be the 
unit circle @ and define the distance d between two points P and Q in J to be 
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We omit the details. 


Historically, it was this 
discovery that excited 
Klein. 


> 


In this discussion we shall 
not bother to distinguish 
between Points and the 
corresponding points in a 
suitable embedding plane, 
for simplicity. 


Recall that log, (xy) = 
log, x + log, y, for any 
positive numbers x and y. 
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the distance d’ between the corresponding points P’ = L(P) and Q’ = L(Q) 
in the Beltrami—Klein model. That is, we define 


d(P,Q) =d'(P’,Q’). 
To verify that this definition agrees with that given when we introduced hyper- 
bolic geometry earlier, let P be the origin and let Q be an arbitrary point (7, 0) 
on the positive x-axis. Then, by Example 1, P’ = L(P) is also at the origin 
and Q' = L(Q) = (s,0), where s = = 


Fs 
r2 


points A(—1,0) and B(1, 0), it follows that 
d(P,Q) =d'(P’,Q’) 
= k log, (ABP'Q’) 


1 l+s 
mee Gee) 

1-= ar 
— k log, (=) 

I+ ee, 


kl 1—2r+r2 
=klo ——_—_____ 
Se 1+2r+r2 


~ tia. (4) 


1 
= —2k log, (; **). 
=/ 


By choosing k = —} and representing P and Q by the complex numbers 0 
and z = r + Oi, respectively, we obtain 


1 1 
d(0,z) = 5 tog, ( =e): 


Since the x-axis meets @ at the 


1—|g| 
This is the logarithm formula for hyperbolic distance in Y that you met earlier. 


8.4 __ Elliptic Geometry: the Spherical Model 


To bring (the spherical model of) elliptic geometry into the Kleinian fold, we 
need to connect its space S? to the space RP” of projective points and its group 
to the group of projective transformations. 

With each point of S? we can associate a line through the origin, the line that 
joins the given point to the origin; this map assigns the same line to a pair of 
antipodal points. Conversely, given a projective point we can associate it with 
the pair of diametrically opposite points that it cuts out on S*. In this way, we 
see that there is a 2-1 map of S* to RP” that sends each pair of antipodal points 
on S* to the line through the origin (the projective point) that they define. 

Since a spherical transformation sends antipodal points to antipodal points, 
it maps projective points to projective points. Also, every spherical transfor- 
mation maps great circles to great circles, and so maps projective lines to 
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Notice that we use d’here, 
temporarily, to avoid 
confusion. 


Subsection 6.3.1 


Here we use that 


log, (<) = — log, (x) 
x 


and 


log, (x?) = 2 log, (x). 


Subsection 6.3.1 


Elliptic Geometry: the Spherical Model 


projective lines. So it is clear that spherical transformations are projective 
transformations, and the group of spherical transformations is a subgroup of 
the projective group. 


Projective Geometry and Spherical Geometry 


Spherical geometry is a subgeometry of projective geometry which preserves 
the Euclidean angle between two lines through the origin in R*; the exact 
description of the subgroup is quite subtle, however, because the map from 
points of S? to Points in RP? is 2-1. 

One method of representing a projective point by a single point on S? is 
to keep exactly half of S*, in some sensible way. For example, let E” be the 
subset of S* defined by 


E* = northern hemisphere, {(x, y,z) : z > O} 


U half the equator, {(x, y,z) : z= 0,x > O} U{B(O, 1,0)}. 


Then every line through the origin meets E? in exactly one point, and every 
projective line meets E? in half a great circle. 

The 2-1 map from S? to RP? therefore gives a 1-1 map from S* to E?: 
every point of RP? corresponds to a unique point of E*, and every point of 
E? defines a unique Point of RP?. So E? provides ‘a complete picture’ of 
projective space RP”. 

To see what’s odd about E”, consider the Points corresponding to the points 
(sin 8,0, cos @) on S? as they trace out the Line y = 0 in RP. Each such Point 
gives rise to two points on S?: 


Pr= (siné,0,cos@) and P= (—sin6,0,—cos@). 


As 6 goes from 0 to 7/2, the point P+ lies in E?. But when 6 > 77/2, the 
point P~ lies in E7, rather than the point P+, and this continues as 6 goes 
from 2/2 to 37/2 (including 6 = 37/2). Then as 6 goes from 3/2 to 27, it 
is again the point P+ that lies in E?. 

So in E? the image of the Line is one-half of a great circle, described twice. 
There is an awkward switch in E* from the Euclidean point corresponding to 
the projective point being Pt to being P~. We cope with this by imagining the 
pairs of antipodal points to be ‘joined up’; that is, we agree to regard antipodal 
points in the equator of S* as ‘being the same point’. 

Conics appear equally odd. For example, let E denote a projective conic 
made up of Euclidean lines through the origin of R? that form a right circular 
(Euclidean) cone in R?. This cone meets S? in two little circles. 

Suppose first that the points P+ all lie in E7, so that the cone does not meet 
the equatorial plane in S. Then the points in E* that correspond to E forma 
little circle on S?. 

Suppose next that the cone cuts the equatorial plane of S?. Then we must 
switch from P* to P~ as the Point describes the Line, and the points in E 2 
that correspond to E form two curves (parts of little circles) on S*. 
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The description lies 
beyond the scope of this 
book. 


Here we give an informal 
description of the 
representation. 


Here we discuss the points 
of S? that correspond to 
the complete Line y = 0. 


E2 


image of Line 
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image of projective conic 


image of projective conic 


projective conic 


projective conic 


To see what’s really odd about this, ‘thicken up’ the Line until it is a strip 
bounded by two little circles in S?. The boundaries of the strip can be taken 
to be two projective conics, one on either side of the Line. Since the image of 
the Line meets the equator in S*, we know that the images of the (associated) 
projective conics also meet the equator of S?. 

Now trace the image of the Line on S?. As you trace the Line from N to N’, 
the curve PP’ (which is part of a projective conic PP’ P”) is to your right, and 
the curve QQ’ (which is part of a projective conic QQ’ Q”) is to your left. 

When you reach the equator, the equatorial points N’ and N” are identified, 
and you can complete your journey round $7 to return to N. But the point P’ 
is identified with Q”, so that the moving point that started from P returns to 
Q; similarly, the point Q’ is identified with P’’, so that the moving point that 
started from Q returns to P. 

The curve P” P P’ that was on your right is now on your left, and the curve 
Q” QQ’ that was on your left is now on your right. The image of the strip is 
a strip in which the ends have been joined after receiving a half-twist. The 
image is a Mébius band, which illustrates a significant mathematical point: Modbius bands are beloved 
2-dimensional real projective space RP is not orientable — that is, it cannot of conjurers! 
be given an everywhere-consistent sense of left and right. That is, of orientation. 


Inversive Geometry and Spherical Geometry 


In inversive geometry, we study transformations which map generalized cir- 
cles to generalized circles and preserve the magnitude of angles. Every 
transformation in inversive geometry is composed of a sequence of inversions. 
In spherical geometry, we study transformations which map great circles to 
great circles and preserve the magnitude of angles. Every transformation in 
spherical geometry is composed of a sequence of reflections in great circles, 
which are therefore inversions. It follows that spherical geometry is a subge- We omit the full details. 
ometry of inversive geometry, in exact analogy with the way that Euclidean 
geometry is a subgeometry of affine geometry. 


8.5 Euclidean Geometry 


Spherical geometry appears easier to understand than hyperbolic geometry 
because we can produce spheres S* as a model of the geometry, but there 


Euclidean Geometry 


seems to be no surface in Euclidean 3-dimensional space which is a good 
model of hyperbolic geometry. One may exhibit ‘pieces’ of hyperbolic geom- 
etry, but it was shown by David Hilbert around 1900 that no surface embedded 
in 3-dimensional space is a complete model of hyperbolic 2-dimensional space. 

But it can be argued that space might be hyperbolic (indeed, some would 
say that, according to Einstein’s theory of General Relativity, it is more nearly 
hyperbolic than Euclidean). This raises the question: Is Euclidean geometry a 
subgeometry of hyperbolic geometry? We sketch a discussion of the answer to 
this interesting question. 

Remarkably, the answer to the question is YES: Euclidean geometry is a 
subgeometry of hyperbolic geometry 

To see that this is so, we need first to ‘go up a dimension’. Hyperbolic 
3-dimensional space can be modelled on the interior of the unit ball: a hyper- 
bolic point is a point (x, y, z) such that xe yn +2 < 1, and a d-line is an arc 
of a generalized circle perpendicular to S*. All the other definitions in hyper- 
bolic geometry generalize in the obvious way, including that of inversion and 
hyperbolic reflection. In particular, the transformations of hyperbolic geometry 
correspond to composites of inversions in spheres perpendicular to S*. 

In the unit ball horospheres are spheres that touch S? at a single point. For 
simplicity, we now fix our attention on the horosphere Hy that touches S? at 
N, and restrict our attention to the subgroup Gy, say, of hyperbolic trans- 
formations that map Hy to itself. Then we have the following remarkable 
result. 


Theorem 1 The geometry on Hy is Euclidean geometry, and the group 
Gy acts on Hy as Euclidean isometries. 


Proof Invert the whole figure in a sphere with centre N. The point N is 
: Z = a, say, and Hy maps to 
a plane parallel to z, the plane z’ : z = b, say. An element of Gy is an inver- 
sion in a generalized sphere through N and perpendicular to Hy; following 
composition with the first inversion, the element of Gj becomes an inver- 
sion in an unbounded generalized sphere perpendicular to the plane z’. These 
generalized spheres must in fact be planes perpendicular to the plane zr’. 
Now, inversion in a plane is Euclidean reflection in that plane. So the 
subgroup mapping the image of Hy to itself is the group generated by all 
Euclidean reflections, which is the group of Euclidean isometries. It follows 
that the geometry on the plane z’ is Euclidean geometry. The desired result 
then follows by applying the inverse of the original inversion. | 


mapped to ‘infinity’; S* maps to a plane z 


If indeed we had grown up believing that we lived in a hyperbolic 
3-dimensional space, we should therefore be very familiar with spheres and 
horospheres! 
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David Hilbert 
(1862-1943), a professor 
at Gottingen University, 
was a mathematical 
universalist, influential in 
the foundations of 
mathematics; he is best 
known for his 1900 ‘23 
Problems’ collection that 
profoundly influenced the 
course of 20th-century 
mathematics. 


It lies beyond our scope to 
deal with the matter fully. 


S2 = {Gey.z)i 
xrpyr+r7= 1}. 
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Summary of Chapter 8 


Section 8.1: Affine Geometry 


1. The group of affine transformations can be identified with the subgroup of 
projective transformations that map RPS onto itself. 

2. Euclidean geometry is a subgeometry of affine geometry, which is itself 
a subgeometry of projective geometry; the three form a hierarchy. The 
higher up the geometry in the hierarchy, it has more transformations, fewer 
properties and fewer congruence classes. 


Section 8.2: Projective Reflections 


1. Let F be a Point that does not lie on a Line @. Then a projective reflection 
r in with centre F and non-zero parameter k is a mapping of RP” to itself, 
with P mapping to Q, where (1) Q = P if P lies on @ and (ii) the cross-ratio 
(FRPQ) = k, where R is the Point of intersection of FP and @. 

2. A projective reflection is a projective transformation. Every projective 
transformation can be expressed as a composite of a finite number of 
projective reflections. 


Section 8.3: Hyperbolic Geometry and Projective Geometry 


1. Orthogonal projection maps (x,y,z) € R°* to (x, y,0) € R? or equiva- 
lently, to (x, y) € R2. 

2. The linearization map L of the unit disc Y onto itself consists of the 
inverse of the stereographic projection map followed by the orthogonal 
projection map. 

3. In the Beltrami—Klein model of hyperbolic geometry the space is Y and 
the d-lines are straight-line segments. 

4. Let t be the hyperbolic reflection in the d-line d joining the boundary points 
A and B of Y, and let F be the pole of the (Euclidean) line @ through A 
and B. Then LtL~! is the projective reflection r in ¢ with centre F and 
parameter —1. 

5. Every projective transformation that fixes @ and maps Y to itself corre- 
sponds to a hyperbolic transformation. 

Hyperbolic geometry can be considered as a subgeometry of pro- 
jective geometry, where the points are points inside a fixed projective 
conic @ and the group of transformations corresponds to those projective 
transformations mapping @ onto itself and the inside of @ to itself. 

6. In the Beltrami—Klein model, we can define the distance between two 
points P and Q of Y (where PQ meets @ at A and B) by d(P,Q) = 
k log, (ABPQ), where k is chosen so that distances are non-negative. This 
is equivalent to the definition of distance in the Poincaré model. 


Euclidean Geometry 


Section 8.4: Elliptic Geometry: the Spherical Model 


Ll. 


3. 


There is a 2-1 map of S* to RP? that sends each pair of antipodal points 
on S? to the line through the origin that they define. There is a 1-1 map of 
E? = {(x, y,z):z > O}U{(x, y,z) :z =0,x > 0} U {(0, 1, 0)} to RP?. 


. Spherical transformations are projective transformations; the group of 


spherical transformations is a subgroup of the projective group. 
Spherical geometry is a subgeometry of inversive geometry. 


Section 8.4: Euclidean Geometry 


iF 
2: 


Euclidean geometry is a subgeometry of hyperbolic geometry 
A point in 3-dimensional hyperbolic space is a point (x, y, z) with x7 + y?+ 
2? < 1, and ad-line is an arc of a generalized circle perpendicular to S?. 


. A horosphere is a sphere in R? that touches S? at a single point. 


Let Hy be the horosphere that touches S$? at N, and Gy the subgroup 
of hyperbolic transformations that map Hy to itself. Then the geometry on 
Hy is Euclidean geometry, and the group Gy acts on Hy as Euclidean 
isometries. 
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Special Symbols 


{a,b, ..., t} 
{x : x satisfies some condition} 


(a,b) 
[a, b] 
[a, b) 
] 


Sl cna) hm oS 
SF 
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set consisting of the elements a, b,..., t 
set consisting of the elements x that satisfy 
some condition 

{x:a<x <b} 

{x:a<x<b} 

{x:a<x <b} 

{x:a<x <b} 

composition of functions (from right to left) 
belongs to 

does not belong to 

is a subset of 

is not a subset of 

equals or is a subset of 

set-theoretic union 

set-theoretic intersection 

empty set 

is equivalent to, is congruent to 

maps to 

the function f maps the set A to the set B, 
with the element x mapping to the element y 
determinant of a matrix A 

adjoint of a matrix A 

transpose of a matrix A 

inverse of a matrix A 

dot or scalar product of vectors x and y 
length of a vector v 

inverse of the transformation t 

set of real numbers 

set of positive real numbers 
n-dimensional Euclidean space 

set of complex numbers 

set of natural numbers 
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A triangle 
>>) sum 
Z angle 
A(2) set of all affine transformations of R2 
RP? real projective space of dimension 2 
P(2) group of projective transformations of RP? 
[x] homogeneous coordinates of a Point in RP? 
(ABCD) cross-ratio of the ordered set of points A, B, C, D 
S, Si, Sij Joachimsthal’s notation for conics 
Q unit disc, {z:z€C: |z| < l}or 
{(x, y) ‘ix 9) eR, 2? y? = 1} 
C unit circle, {z: z€C: |z| = l} or 
{(x,y): @y) eR’, x? + y? = 1} 
Rez real part of a complex number z 
Imz imaginary part of a complex number z 
Zz conjugate of a complex number z 
[z| modulus of a real or complex number z 
Cc extended complex plane C U {oo} 
A-B set of points in A that are not in B 
S Riemann sphere 
Gg group of hyperbolic transformations 
d(Z1, Z2) hyperbolic distance between z, and z2 
H upper half-plane, {z : z € C, Imz > 0} 
o unit sphere in R?, {@, y,z) (x,y,z) ER, x27 +y2?4+2= 1} 
S(2) group of spherical isometries 
R(X, a) rotation about the x-axis in R3, through an angle a 
R(Y, p) rotation about the y-axis in R°, through an angle 


R(Z,y) rotation about the z-axis in R?, through an angle y 
m(X,Y,Z)  stereographic projection of (X,Y, Z) 


A family of Apollonian circles 

RP% Points in RP? that pierce the standard embedding plane 
z=1, {[x, y,z] € RP* :z 40} 

Po(2) group of projective transformations mapping the Line z = 0 
to itself 

L linearization map of unit disc Y onto itself 

E2 union of {(x, y,z) : z > O} and {(x, y,z):z=0, x > O} U 
{(0, 1, 0)} 

Hy horosphere that touches S* at N 


Gn group of hyperbolic transformations that map Hy to itself 
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Appendix 1: A Primer of Group Theory 


A binary operation o defined on a set G is a map from G x G to G; that is, if 
gi and go are elements of G then g; 0 g2 € G. We call the binary operation o 
composition and g o g2 the product of g; and go, and we say that the binary 
operation is closed. An isometry of a figure F in R? or R? is a mapping of F 
to R? or R? that preserves distances; it is a direct isometry if it can be carried 
out by a ‘continuous’ transformation within the space concerned, and indirect 
otherwise. 


Definition Let G be a set and o a binary operation defined on G. Then 
(G, o), or G for short, is a group if the following four axioms hold. 


Gl CLOSURE For all 91, g2 € G, g1092 €G. 
G2 IDENTITY There exists an identity element e € G such that for 
allg €G, 


LOGS 8 =CoR. 


G3  INVERSES For each g € G, there exists an inverse element 
g~! € G such that 


= 1 
Sly Sey ~ ke 
G4 ASSOCIATIVITY For all g1, g2, 93 € G, 


81 © (92 0 83) = (1 0 82) 0 83. 


In any group, the identity element is unique and each element has a unique 
inverse. If g belongs to a group, then (g~!)~! = g. 
Examples of groups include: 


The integers Z with the binary operation of addition, +; 

The real numbers R with the binary operation of addition, +; 

The positive real numbers R* with the binary operation of multiplication, x; 
The set of all translations of the plane R?; 

The set of all rotations and translations of the plane and their composites; 
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e The set of all composites of all reflections of the plane (this is the isometry 
group of the plane); 
e The set of all 2 x 2 matrices with non-zero determinant. 


If the binary operation o of a group G is commutative (that is, g} o g2 = 
g2 0 gy for all g1, g2 € G), we say that the group is commutative or Abelian. 

A subset H of a group G is a subgroup of G if it is a group with respect to 
the same binary operation. This can be reformulated as follows: 


Definition Let (G, 0) be a group with identity e, and let H be a subset of 
G. Then (H, 0) is a subgroup of (G, o) if the following axioms hold: 


SG1 CLOSURE  Forall hy; and h2 € H, the composite h; oh2 € H. 
SG2 IDENTITY The identity element e € H. 
SG3 INVERSES For eachh € H, the inverse element h—! € H. 


The order of an element g of a group G is the least positive integer n 
such that g” =e. If there is no such integer, we say that g has infinite order. 
A cyclic group G is one in which every element is of the form g” for some 
fixed element g € G; g is called a generator of G. 

The order of a group is the number of elements in it. If this number is infi- 
nite, we say that the group is of infinite order. For a group G of finite order, 
Lagrange’s Theorem asserts that the order of any subgroup of G divides the 
order of G. 

Two groups that have the same algebraic structure are called isomorphic. 


Definition Two groups (G,o) and (H,«) are isomorphic if there exists a 
mapping ¢ : G — A such that the following properties hold: 


1. ¢ is one-one and onto; 
2. for all g1, g2 € G, b(g1 © 82) = $(g1) * $(g2). 


Such a mapping ¢ is called an isomorphism. 


Two elements x, y of a group (G,o) are conjugates of each other in G if 
there exists an element g € G such that y= go x0 g~! (which we often write 
simply as y = gxg~!). We say that g conjugates x to y, and that g is a 
conjugating element. 

Let H be a subgroup of a group (G,o), and let g € G; then the set gH = 
{goh:he H} isa left coset of H in G. It is a subset of G, but in general not 
a subgroup of G. Two left cosets g;H and g2H are either equal or have no 
elements in common. A right coset is defined similarly as a set of the form 
Ag ={hog:he H}. 

For any given group G and subgroup H and any two elements gy and go of 
G, there is usually no particular relation between a left coset g; H and a right 
coset Hg; a right coset need not be a left coset, nor a left coset a right coset. 
If however every right coset is a left coset, and every left coset a right coset, 
then the subgroup H of G is called a normal subgroup of G. 
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Definition A subgroup H is a normal subgroup of a group G if, for each 
element g € G, gH = Hg. 


For example, the group of all translations of the plane is a normal subgroup 
of the group I’ of all isometries of the plane; but the group of all rotations 
about a fixed point P in the plane, although a subgroup of I’, is not a normal 
subgroup of I’. 

If H is a normal subgroup of G, we can talk about the cosets of H in G 
rather than the left cosets and the right cosets. In this case, the cosets them- 
selves form a group under the binary operation -, called set composition, 
defined for subsets A, B of G as follows: 


A-B={aob:a€A,beE B}. 


(In particular, gj H - goH = (g1 © g2)H, for all gj, go in G.) This group is 
called the quotient group of G by H, and is denoted by G/H. The identity 
element of G/H is the coset eH = H, and the inverse of gH is g~'H. 

Every subgroup of an Abelian group G is a normal subgroup of G. 


Note that this does not 
require that gh = hg for 
each h. 


Appendix 2: A Primer of Vectors 
and Vector Spaces 


Vectors 


A vector is a quantity that is determined by its magnitude and direction. 
A scalar is a quantity that is determined by its magnitude. A vector in R? 
or in R* can be represented by a line whose length is a measure of the 
magnitude (or length) of the vector and whose direction is the same as that 
of the vector, A vector represented by a line segment from A to B is often 
written as AB. 

The vector space R? is the set of ordered pairs of real numbers with the 
operations of addition and scalar multiplication defined as follows: 


(x1, v1) + (x2, y2) = Cr + x2, v1 + 2); 
a(x,y) = (ax,ay), for any real number a. 
In R? the vectors i and j are unit vectors in the positive direction of the 
x- and y-axes, respectively. Any vector v in R? can be expressed as a sum of 
the form 


v=.xi-+ yj, for some real numbers x and y; 


often we simply write v = (x, y), for brevity. The numbers x, y are the com- 
ponents of v in the x- and y-directions. Similarly, in R? the vectors i, j and k 
are unit vectors in the positive direction of the x-, y- and z-axes, respectively. 
Any vector v in R? can be expressed as a sum of the form v = xi + yj + zk, 
for some real numbers x, y and z. 

The vector space R? is defined similarly. 

The dot (or scalar) product of two vectors u and v is 


u-v= ||lul| x ||v|| x cosé, 


where ||u|| and ||v|| denote the lengths of the vectors u and v and @ is the 
angle between them. It follows that the angle 6 between two vectors u and v 
is given by 
cos 0 = a 

full x {IVI 
Ifu = xji+ yj and v = x2i+ yoj, then u- v= x,x2 4+ yi y2. 

Two non-zero vectors u and v are orthogonal if they are at right angles; then 
u-v=0. 
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The length of a vector v is given in terms of the dot product as ||v|| =./V- v. 


Position Vectors 


The position vector v = xi-+ yj is the vector in R? whose starting point is the 
origin and whose finishing point is the point with Cartesian coordinates (x, y). 
A position vector in R? is defined similarly. 

Let R be a point on the line which passes through two points P and Q in R?, 
and suppose that p, q and r are the position vectors of P, Q and R, respectively. 
Then there is a real number A such that 


r=Ap+(1—A)q. 


The coefficients of p and q in this expression can be used to calculate the 
ratio in which the point R divides the line segment PQ; namely, 


PR 1-:’ 
RQ 2d. 

This ratio is actually a ‘signed’ ratio. If 0 < 2 < 1, then the ratio is positive 
and R lies between P and Q. But if A > 1 or if A < 0, then the ratio is negative 
and R lies beyond P or beyond Q, respectively. - 

This negative ratio can be interpreted as meaning that the vectors PR and 
RQ point in opposite directions. 

Another way in which we can calculate ratios is to use the Cartesian coordi- 
nates of the points concerned. Indeed, provided that the line through P, Q and 
R is not parallel to the y-axis, then it is clear from the figure in the margin that 

PR XR —Xp 

RQ XQ —XR’ 
where xp,xg and xr are the x-coordinates of the points P,Q and R, 
respectively. 

Similarly, if the line through P, Q and R is not parallel to the x-axis, then 

PR _ yR—-ypP 

RQ yo~yr’ 
where yp,yg and yr are the y-coordinates of the points P,Q and R, 
respectively. 

Often the line through P, Q and R will be parallel to neither of the axes, in 
which case both formulas will work. In such cases we can pick the formula that 
makes the calculations simplest. One advantage of the coordinate approach is 
that we do not have to worry about whether the ratio is positive or negative, for 
the coordinates take care of the sign automatically. 


Section Formula The position vector r of the point R that divides the line 
segment ¢ joining the points P and Q with position vectors p and q in the 
ratio (1—A):A 1s 


r=Ap+(1—A)q. 


ratio positive 


Xp 
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It follows that the vector form of the equation of a line through the points 
with position vectors p and q is 


r=Ap+(1—A)q,_ where A varies over all real numbers. 


Vector Spaces 


Definitions A real vector space consists of a set V of elements (called 
vectors) and a binary operation + defined on V such that (V,+) is an 
Abelian group, and such that for each v € V and each real number 
a,av € V. Moreover, 


lv =v, 


a(Bv) = (aB)v, 


and the two distributive laws hold: 
a(v+w) =av+aw, 


(a+ B)v = av- fv. 


Examples of real vector spaces include: 


the set of real numbers with the operations of (ordinary) addition and of 
multiplication by real numbers; 

the set of ordered pairs of real numbers, R? = {(x, y) : x,y € R}, where 
(x1, y1) + (X2, y2) = (41 + X2, 1 + y2) and a(x, y) = (ax, ay); 

the set of ordered triples of real numbers, R= {(x,y,z) : x,y,z € R}, 
where (x1, y1,Z1) + (%2, y2, 22) = (41 +2, yi + y2, 21 +22) anda(x, y,z) = 
(ax,@y,@Z). 


A set E = {e1,€2,...,€n,..-} of vectors is linearly independent if the 
only solution of the equation aje; + a2e2 +... + anen +... = Visa, = 
a2 = ... = O; if it has another solution, then E is said to be linearly 
dependent. 

Given any set E = {e1,€2,...,€n,...} of vectors in a vector space V, the 
set of all possible vectors of the form {aje; + a2e2 +... + Qn€n +... : 
01,02,...,Qy,... real} is called the span of EF, denoted by (F). If E is any 
subset of V, then (£) is a vector space too, called a vector subspace of V. 

A basis of a vector space V is a subset E of V such that E is a linearly 
independent set and E spans V. Then each vector v in V can be expressed 
uniquely in the form 


v=aqye; t+a2eg+...+ane,+..., Where a,Q2,...,Q@,... are real. 


All bases of a given vector space V have the same number of elements. If 
this number is finite, n say, then V is said to be finite-dimensional or to 
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have dimension n; if this number is infinite, then V is said to be infinite- 
dimensional. 
The standard basis of IR” is the set of n vectors 


{(1,0,0,...,0), (0, 1,0,...,0),...,(0,0,..., D}. 


A basis E of a vector space V is an orthogonal basis if each pair of vectors 
in E are at right angles to each other; it is an orthonormal basis if, in addition, 
the length of each vector in F is 1. 


Linear Transformations 


Definition A linear transformation f is a mapping of one vector space V 
into another vector space W such that 


(a) t+ v) = ¢(u) + £(v), for all u, veV; and 


(b) t(av) = at(v), for all veV, and all real a. 


Lett: V — W bea linear transformation of a (domain) vector space V of 
dimension n and with a basis E = {e1,€2,...,€,} into a (codomain) vector 
space W of dimension m and with a basis F = {f1,f2,...,f,}. Then, if 


t(ey) = ayyf} + aoif2 +... + anil, 


t(e2) = ayof} + ao2f2 +... + Gnafm, 


t(€n) = inf) + donf2 +... + danfm, 


the rectangular array or matrix denoted by 


Qi1 @j2_ «.. Gin 

421 422... 42n 
A= (aij) = 

Gm1 Gm2 +--+ Gmn 


with m rows and n columns represents the linear transformation f; we also say 
that ¢ has matrix A. The matrix A is said to be of size or dimension m x n. A 
1-row matrix is called a row matrix, and a 1-column matrix is called a column 
matrix. An m x n matrix is square if m = n; that is, if it has the same number 
of rows as columns. A square matrix A = (a;;) is diagonal if the entries not 
on the principal diagonal from top left to bottom right of the array are zero; 
that is, aj; = 0 fori # j. The identity matrix I, (or I, for short) is the n x n 
diagonal matrix whose entries on the principal diagonal are all 1s. 

An anticlockwise rotation of IR? through an angle @ is represented by the 


matrix 
cos@ —sind 
sin 6 cos@ }’ 
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a dilatation of R? by a factor k by 


(ot): 


and a reflection of R? in the line y = kx by 


1—k 2k 
1+k2 14k 
2k 1-—k 
1+k 14k 


Two matrices A and B are equal if they are of the same size, and corre- 
sponding entries are equal. The sum, A + B, of matrices A and B is defined 
only if A and B are of the same size; the entries of A + B are obtained by 
adding corresponding entries of A and B. For any real number @ and matrix A, 
the matrix wA is a matrix of the same size as A but whose entries are a times 
the corresponding entries of A. The difference A — B of matrices A and B is 
simply A + (—1)B. 

The transpose A’ of a matrix A of size m x n is the matrix of size n x m 
obtained by interchanging the ith row for the ith column; that is, if A = (a;;) 
then A? = (aji). 

The product AB of matrices A and B of sizes m xn and k x / is only defined 
if n = k; in this case, AB is the m x / matrix whose entry in the ith row and 
jth column is the dot product of the ith row of A and the jth column of B. 

If E = {e1,e2,...,e,} is a basis for V and F = {f|,f2,...,f,,} is a basis 
for W, and if v = aje; + a2€2 +...+a,e, € V, then we can express v (in 
terms of its components with respect to E) in the matrix form 


An 


and the image of v under f (in terms of its components with respect to F) in 
the matrix form 


aii a\2 tee ain eat 

a2 a22 aioe a2 a2 
t(v) = AV= 7 . 

Ami Gm2 +--+ Amn Qn 


a1] + 1202 +... + AjnQn 
a21Q1 + 42202 +... + A2nQn 


Am1Q1 + An202 +... + AmnAn 
so that 
t(v) = (ayy + ay202 +... + ayn) fy) + (az1a) + an202 +... 


+ d2n On £2 +... + (m1) + An202 +... + Amn @n)fn.- 
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Let s be a linear transformation mapping a vector space V (of dimension n 
and with a basis E) into a vector space W (of dimension m and with a basis F’) 
with matrix A; let ¢ be a linear transformation mapping the vector space W 
(with the same basis F’) into a vector space K (of dimension k and with a 
basis G) with matrix B. Then the composite mapping ¢ o s is a linear trans- 
formation with matrix BA. In general, AB 4 BA even when both products are 
defined. If the product AB is defined, then (AB)? = B’ A’. 

A linear transformation s of a vector space V into a vector space W is 
invertible if there is a linear transformation ¢ of W into V such that 


sot =tos = the identity mapping of V to itself. 


If V and W are of dimensions n and m, then s is invertible only if m = n; 
if s is invertible and has matrix A, then the matrix of f is denoted by AT, the 
(matrix) inverse of A. If A is invertible, then AA~! = A~'A = 1. 

If A and B are invertible n x n matrices, then AB is invertible and (AB)~! = 
B—!'A—!. A square matrix A is orthogonal if A~! = A’, or, equivalently, if 
AAT =TorATA=L. 


Determinants 


Associated with any square matrix A is a number called the determinant of A, 
denoted by det A, det(A) or |A|. 


b 
If Aisal x 1 matrix (a), then det A=a.If Aisa2 x 2 matrix (: A; 
Cc 


then 
a b 
det A = = ad — be. 
a 
If A is a3 x 3 matrix 
abe 
de fl, 
gh i 
then 
abe 
detA=|de f 
gh i 
xj) © f|-s| ¢ f a d | 
hi g i h 


= aei —afh — bdi+bfg-+4+cdh — ceg. 


In each term of the second last line above, the coefficient is an entry in the 
top row of the determinant, with alternate + and — signs inserted, and the 
associated 2 x 2 determinant is obtained by deleting the row and column of A 
that contain that entry. (n x n determinants may be defined analogously.) 


Appendix 2: A Primer of Vectors and Vector Spaces 
The determinant has the following properties: 
det I = 1; det A’ = dertA; 
det(AB) = detA-detB; detA~! = 1/(detA). 


A square matrix A is invertible if and only if det A #4 0. If Ais a2 x 2 


matrix , then 
c d 


There are several ways of finding the inverse of a 3 x 3 matrix; we give only 
one here! The adjoint matrix, adj A, associated with the 3 x 3 matrix 


abe 
A=|de f]., 

gh i 

is the 3 x 3 matrix 
e f| ld f d e|\° 
h i g i g h 
b 
adj A= | — 7 oe | ale 2 

hi g i g h 
bc _|@ie a b 
é ff d f de 


obtained by replacing each entry in A by the 2 x 2 determinant (called the 
cofactor of that entry) obtained by deleting the row and column of A that 
contain that entry, together with alternate + and — signs as indicated, and then 
taking the transpose. Then 

= 1 


= dj A). 
wa! ) 


The equation of the line through two points A = (a1,a2), B = (b1,b2) in 
the plane R? is 


ay by x ay a 1 
a2 b2 y =0 or bj bo 1 = 0; 
1 1 1 x y 1 


and three points A = (aj,a2), B = (bi, b2) and C = (cj, c2) in the plane are 
collinear if and only if 


ay by Cl a, a2 1 
a2 bo (S39) =0 or by bo 1 =0) 
kL oof ff cy. c@ | 
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Eigenvectors and Eigenvalues 


An eigenvector of a linear transformation ¢ of R? (or R?) to itself is a non-zero 
vector v such that t(v) = Av for some real number A called the corresponding 
eigenvalue. The eigenspace (A) is the vector subspace of R? (or R*) spanned 
by all eigenvectors of t with 4 as the corresponding eigenvalue; an eigenspace 
may have dimension greater than 1. 

If a linear transformation of R2(or R?) to itself has matrix A, then a non-zero 
vector v such that Av = Av for some real number A is called an eigenvector of 
A; A is called the corresponding eigenvalue. 

The eigenvalues 2 of ann x n matrix A satisfy the equation det (A — AID = 
0, called the characteristic equation of A. A 3 x 3 matrix necessarily has 
at least one real eigenvalue. The trace of A, denoted by tr A, is the sum of 
the elements along the principal diagonal of A, and equals the sum of the 
eigenvalues of A. 

To find the eigenvalues and eigenvectors of a matrix A, first solve the char- 
acteristic equation to determine the eigenvalues ; then solve the simultaneous 
eigenvector equations (A — AI)v = 0 to find the corresponding eigenvectors 
and eigenspaces. 

Ann x n matrix A is diagonalizable if there is an n x n diagonal matrix 
D such that D = P~!AP for some invertible n x n matrix P; the entries on 
the principal diagonal of D are the eigenvalues of A. Ann x n matrix A is 
diagonalizable if and only if it possesses n linearly independent eigenvectors. 

A (square) matrix A is symmetric if A’ = A; ann x n symmetric matrix 
has n real eigenvalues. A symmetric n x n matrix A is orthogonally diago- 
nalizable; that is, there is an n x n diagonal matrix D such that D = P-'AP 
for some orthogonal n x n matrix P. The columns of P are eigenvectors of A 
of length 1, and are mutually orthogonal. 
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Chapter 1 
Section 1.1 


1. 


Here we use the standard formula for the equa- 
tion of a circle of given centre and radius given 
in Theorem 1. 


(a) This circle has equation 
(@@-0) +Q-OP =P, 
which can be rewritten in the form 
x74 y? = 1 
(b) This circle has equation 
(«0° +(y—-0" = 4, 
which can be rewritten in the form 
Ro y? = 16. 
(c) This circle has equation 
(@—3) + —- 4" = 2’, 
which can be rewritten in the form 
x? 4 y* —6x —8y+21=0. 
(d) This circle has equation 
(x — 3)? + (y- 4) = 3°, 
which can be rewritten in the form 


x? + y* —6x —8y+16=0. 


. Since the origin lies on the circle, its coordinates 


(0,0) must satisfy the equation of the circle. 
Thus 


07 +0°?+ f-0+g¢-0+h=0, 


which reduces to the condition that h = 0. 


3. (a) We can use the general formula of Theorem 


2 for centre and radius, with f = —2,g = 
—6 and h = 1. It follows that the circle has 
centre (1,3) and radius V9 = 3. 

Here the coefficients of x? and yr are both 3, 
so we divide the equation by 3 to get it into 
standard form (that is, with the coefficients 
of x? and y? both 1); the equation becomes 


(b 


wm 


x? + y? —4x — 1l6y =0. 


We can use the general formula of Theorem 
2 for centre and radius, with f = —4,g = 
—16 and h = 0. It follows that the circle has 
centre (2, 8) and radius V68 = 2/17. 


4. (a) If we complete the square in the equation 


er+ytxetyt+1=0, (1) 


we obtain the equation 


iy i I 
P82) 2) =n 

This equation represents the empty set, 
since its left-hand side is always non- 
negative whereas its right-hand side is neg- 
ative. 

(Alternatively, we can apply the Remark 
following Theorem 2 of Subsection 1.1.2 to 
the quantity if? + 58° —h. In equation (1), 
we have f = 1,g = | andh = 1. Thus 


[Pt fe—nahe}n1<o 


so that the set must be the empty set.) 
(b) If we complete the square in the equation 


x7 +y?—-2x+4y4+5=0, (2) 
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(c 


wm 


we obtain the equation 

(x — 1)? -1+(y+2)?-44+5=0, 
or 

(x — 1)? + (y +2)? =0. 

Thus the set in the plane represented by 
equation (2) is the single point (1, —2). 

(Alternatively, we can apply the Remark 
following Theorem 2 of Subsection 1.1.2 to 
the quantity i f+ 48° —h. In equation (2), 
we have f = —2,g =4andh =5. Thus 


if? + fe? -h=1+4-5=0, 
so that the set must be a single point 
(-2F,-48) = ,-2)) 
Here the coefficients of x” and y” are both 


2; so we divide the equation by 2 to get it 
into standard form as 


(3) 


Then, if we complete the square in equation 
(3) we obtain the equation 


2 2 
1 1 3 9 5 
(« a3 i) i6 (v 3) 16 ~ 3 = 9, 


or 5 4 


Thus the set in the plane represented by 


x+y? +5x-3y-3 =0. 


equation (3) is a circle with centre (-4. 3) 


and radius ne = 2/2. 

(Alternatively, we can apply Theorem 2 
to the quantity ; fr 48° — h. In equation 
(3), we have f = 598 = -3 and h = -3. 


Thus 
1 42 1 2 1 9 5 25 
if +38 —h ig tig ta an 


so that the set is the_circle with centre 
(-1. 3) and radius ,/ = — 3/2.) 


5. Here we use the result of Theorem 3. 


(a) The condition that the intersecting circles 


Ci :x? + y?—4x —4y+7=0 
Co :x7 + y* + 2x -—8y+5=0 


(b 


in 


in 
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are orthogonal is that 

(—4)2 + (—4)(-8) = 2(7 + 5); 
that is, that 24 = 24. This is the case; hence 
C, and C2 are orthogonal. 
Here we must divide the given equation 
of C2 by 3 in order to get it in the form 
x? 4+ y*+--- = 0, before using Theorem 3. 
The condition that the intersecting circles 


Crix+y?+3x—-6y+5=0 


wm 


Coix?t+y?+Fxt+4y—-5=0 
are orthogonal is that 

(3)(4) + -0)(4) =26 - 5); 
that is, that 4 — 2 = O. This is not the case; 
hence, C; and C2 are not orthogonal. 


. First we write the equation of the second circle 


the (normalized) form 

x+y? +3x-3y+3=0, 
which the cofficients of x? and y? are 1. 
Then, by Theorem 4 (with k = —1) it follows 


that the equation of the line through the points 
of intersection of the two given circles is 


or 


x? 4+ y?—3x+4y-1 


—1(x? +y?+ $x —3y +3) =0, 


—Fx+7Ty—3=0; 


we can write this in the simpler form 


. (a 


(b 


lx — 14y +5=0. 


) The parabola E is the parabola in standard 
form where 4a = 1, ora = i: It follows 
I 

(1 
is (0,0), the axis is the x-axis, and the 
equation of the directrix is x = — 7 

) The coordinates of P and Q are (t?, t1) and 
(13,12), respectively. So, if t? # 5, the 
slope of PQ is given by 


that the focus of E is 0), the vertex 


t) —t2 


m= 
av) 
tty 


1 
t+t 
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Since (i 1) lies on the line PQ, it follows 8. 


that the equation of PQ is 


n= agli) 
= x a 
eam t+ t2 : 
Multiplying both sides by ft; + f2, we get 

(atm) —h)=x-4, 
so that 

(1 +h)y —t{ —th =x- 4, 
or 
(1 +h)y=x+thh. (4) 


If, however, i = oe then since 1 4 h 
we have tf) = —f. Thus PQ is parallel to 
the y-axis, and so has equation x = rE so 
in this case too, PQ has equation (4). 

The midpoint of PQ is the point 


(3(# 4 #2) (4+ A) 
This is the focus ( },0) if 
(3(#? +12) .3@1 +0)) = (4.0). 
Comparing the second coordinates, we 


deduce that fg = —t,. Comparing the first 
coordinates, we deduce that 


1 (,2 2) _ 1 
3 (+4) =4 


(c 


wm 


so that if = i It follows that t; = +} and 9. 


so that f = +5, respectively. 


When t = 5 the point (id) — (i 3): 
-5, the point (17,1) = 
-}). It follows that the points P and 


and when t = 


I 
4? 


Q must be (: 3) and (4,-3). 


Remark 


It follows that the chord PQ must be the vertical 
chord through the focus. 
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yA 


P (acost, b sin t) 


av 


Let P have coordinates (a cost, b sint). Since 
the coordinates of F are (ae, 0), the coordinates 
of M, the midpoint of FP, are 


(F(acost + ae), $ (asint +0). 


Thus M lies on the curve in R? with parametric 
equations 


r= 5(a cost+ae), y= 5b sint. 
We can rearrange these equations in the form 


2x — ae : y 
cost = , sin : 
a b 


squaring and adding these, we get 


(A=) (2) 
ey ae) =a 
a b 


We can rearrange this equation in the form 


1 2 
(« = sae) y? 


(aj eS 


thus M lies on an ellipse with centre (40, 0), 
the point midway between the origin and F. 


YA 


YY 7 -2y?=1 


(a) This hyperbola is of the form = x =1 


a2 


with a = 1 and b? = 4, so that b = 1//2. 
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(b 


(c 


Ye 


wm 


If e denotes the eccentricity of the hyperbola 
E, so that b? = a? (e? = 1), we have 


5 =e—1; 
it follows that e2 = 3 and soe = nea 


In general the foci are (ae, 0); it follows 


that here the foci are (= 3, 0). 


Let F and F’ be ( 5.0) and (-/3.0). 


respectively. (It does not really matter which 
way round these are chosen.) 
Then the slope of FP is 


ar 


where we may assume that sect # Fo 
since FP is not parallel to the y-axis; and 
the slope of F’P is 


1 

vq tant — 0 tant 

sect + /3 VJ/2sect + V3 
where we may assume that sect A — 3, 


since F’ P is not parallel to the y-axis. 
When FP is perpendicular to F’ P, we have 


that 
tan t tan ¢ 


J2sect—/3 /2sect +3 ~ 


We may rewrite this in the form 


tan’ f 
2sec2t—3 
so that 2sec?t — 3 + tan? 
sec? t = 1 + tan? 1, it follows that we must 
have 3 tan? t = 1. Since we are looking for 
a point P in the first quadrant, we choose 
tant = 1//3. 

When tant = 1/V3, we have sec?t = 
1+ ; :. Since we are looking for a 
point P in the first quadrant, we choose 
sect = 2//3. 


It follows that the required point P has 


a4, 


0; since 


coordinates 
(Fede a) = ee): 
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Section 1.2 


1. 


We use the formula of Theorem | to find the 

slope of the tangent to the curve at the point with 

parameter f, where ¢ is not a multiple of z. 
Since 


x(t) =2cost+cos2t+1 and 
y(t) = 2sin¢ + sin2r, 

we have 
x(t) = —2sint —2sin2t and 


y(t) = 2cost +2cos2t. 


Hence the slope of the curve at this point is 


y(t) _ 2cost + 2cos2t 
x(t) —2sint — 2sin2r 


cos ft + cos 2t 
~ sing + sin 21” 
In particular, at the point with parameter 
t = 7/3, this slope is 


cos 1/3 + cos2m/3 _ 5-5 


= =0 
sin 2/3 + sin 27/3 J/3/2 + V3/2 


Thus the tangent at this point is horizontal. Also, 


y(/3) = 2 sin(/3) + sin(27/3) 
v3 3 
= <v3. 
2 si 2 2 v3 
It follows that the equation of the tangent at the 
point with parameter ¢ = 27/3 is y = 3/3. 


Next, at the point with parameter ft 
the slope of the curve is 


m/2, 


0-1 


cosm/2+cosm _ _ 
1+0 


ile 


sinz/2+sinn 
Also, 
x(/2) = 2cos(z/2)+cos(z7)+1 = —14+1=0 
and 
y(a/2) = 2sin(/2) + sin(7) = 24+0=2. 


It follows that the equation of the tangent at 
the point with parameter f = 7/2 is 


y—2=1@%-0), ory=x+2. 
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2. 


(a) Here x/(t) = 1 and y’(t) = —1/t?; it fol- 
lows that the slope of the tangent at the point 
with parameter f is 


y@) — -1/t? 
x(t) 1 
1 
=-5. 


It follows that the equation of the tangent at 
the point P is 


or 


i) 


x 
The line with equation y = = + 2 passes 
through the point (1, —1) if 
—-l|= ate + 2 
aa a 
We can rewrite this equation in the form 


7 +2t-1=0, 


(b 


we 


or 
+1? =2; 

it follows that the values of ¢ at the two 

points on the hyperbola for which the tan- 

gents pass through (1, —1) are 


t=-1+¥V2. 
When t = —1 + V2, the equation of the 
tangent is 
x ny 2 
y = 
(v2 _ 1)" J/2-1 
x 2 

oe : 

3-2/2 2-1 
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When t = —1 — »/2,, the equation of the 


tangent is 
x 4 2 
y= 5) 
(-1- v2) -1-/2 
_ x 2 
3427/2 14/2 


3. The rectangular hyperbola xy = 1 has paramet- 


ric equations x = t,y = 1/t (where t ¥ 0). 
You found in Problem 2(a) that the slope of the 
tangent at the point with parameter f is 


ya oL 
x(t) tf 


Since -4 == at the slope of the tangent at 


the point (x1, y;) may be written in a convenient 
form as = (The slope may be expressed in 
many other forms involving x; and yj, but this 
particular form saves some algebra later in the 
calculation.) 

Then the equation of the tangent to the hyper- 


bola xy = | at the point (x1, y1) is 
1 
yr-n= ag — x1). 
xX} 


Multiplying both sides by x; , we may express 
this as 


x1 —X1yL = —xY1 + X11, 
so that 
xy t+ xy = 2x1y1 
=2; 


dividing this by 2, we obtain the required 
equation. 


. The required equations may be obtained by sim- 


ply substituting numbers into the appropriate 
equation in Theorem 2 or Problem 3. 


(a) The equation of the tangent to the unit circle 
x+y? = Lat (-},403) is 


x(-4) + v($v3) =i, 
which may be written in the form 


V3By =x +2. 
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(b) The equation of the tangent to the rectangu- 
lar hyperbola xy = | at (-4, -4) is 


Ho(-8) 9) =1 
which may be written in the form 
x+ loy = —8. 


(c) The equation of the tangent to the parabola 
y* = x at (1,—1) is 


yD = 304+ D, 
which may be written in the form 
x+2y=-1. 


5. Since 2? + 3? > 1, the point (2,3) lies outside 
the unit circle. Hence, by Theorem 3, the polar 
of the point (2,3) with respect to the unit circle 
has the equation 


2x+3y=1. 


(a) We saw in Example 2(a) that the slope of 
the tangent at the point P with parameter 
t (where tf 4 0) is 1/t. Since the normal 
and the tangent at P are perpendicular to 
each other, it follows that the slope m of 
the normal at P must satisfy the equation 
m- (1/t) = —1. Hence m = -t. 

The normal at P is thus the line through 
the point (ic 2t) with slope —f, and so has 
equation 


(b 


wm 


or 
y= —txt+2tte. (1) 
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(c) Let Q be the point on the parabola with 
parameter 7, say; thus its coordinates are 
(77,27): Since Q lies on the line with 
equation (1), it follows that 


2T = -1T* +2t4+2°; 
we can rearrange this equation in the form 
UT —th= -1(7° = ?) : 


Since T  t, we may divide through by 
T —f, to get 


2=-t(T +t) 


=-tT —??, 


directrix 


(a) P; has coordinates (at? 2at,) and P has 
coordinates (atz, 2at2). So, if i x ic the 
slope of the chord P; P2 is 


2aty — 2at, ty —t 

2 2 2 2 

aty — aty ty — ty 
2 

tatty 


It follows that the equation of P| P2 is 


2 2 
y — 2at) = (« - at;) 
y+ 


or 
(4 + m)(y — 2at)) =2(x-at?). @) 
If, however, iD — ie we must have 
t; = —f) since P; and P) are distinct. The 


chord P; P is then parallel to the y-axis, so 
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(b 


(c 


(d 


wm 


wa 


wm 


that we can write its equation as x = at?. 
Thus the equation of the chord is given by 
equation (2) in this case too. 

If the chord P; Po passes through the focus 
(a, 0), the coordinates of (a, 0) must satisfy 
equation (2); hence 


(4 +)(—2an)) = 2(a —at?), 
so that 
—t? — tt = 1-22. 


It follows that fjf2 = —1. 
It follows from Example 2(a) that the equa- 
tions of the tangents at P; and P> are 


ty =x +at? 

and 
_ 2 
ty =x + aly, 


respectively. 

Now it follows also from part (a) of 
Example 2 that the slopes of the tangents at 
P, and P> are 1/t, and 1/t, respectively. 
These tangents are perpendicular if 


@)-()=- 


and we can rewrite this condition in the 


form ft,f2 = —1. 
We have already seen in part (b) that 
t}t2 = —1, and so we deduce that the tan- 


gents at P; and P2 are indeed perpendicular. 
The equations of the tangents at P; and P 
are 


ty =x-+at; and f2y =x-+at;, 


respectively. By subtracting these, we see 
that at the point (x, y) of intersection, 


(t) —h)y = a(t - #3) : 
so that 
y =a(t, +12). 


It then follows from the equation tyy = 
x +t; that 


tya(ty +t) =x+ aty, 
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so that 
xX = atito 
=-a (since tif, = —1). 


The point of intersection is therefore 
(—a, a(t) + 12)). 

Since the first coordinate of the point of 

intersection is —a, it follows that the point 
of intersection lies on the directrix of the 
parabola. 
Since (by the result of Example 2(a)) the 
tangent at Q has slope 1/t, when t 4 0, 
it follows that in this case the normal at Q 
has slope —t. When t = 0, the point Q is 
the origin, the vertex of the parabola; so in 
this case too the slope of the normal is —r. 

Hence in general the equation of the nor- 
mal at Q is 


y —2at = —t(x —at?). (3) 


(e 


wm 


If this normal passes through F(a,0), 
then the coordinates of F must satisfy equa- 
tion (3); that is, 

—2at = —t(a— at’). 

We can divide through by a and then 

rearrange the terms in this equation to get 
O=r(1+4+?°). 

It follows that t = 0, and so Q must be the 

vertex of the parabola. 


: : 
E 
Let a1, 02,03, 04, B1, B2, 83, Ba be the angles 


indicated in the above diagram. 
Then 


a} =a2 (vertically opposite angles) 


= a3 (by the Reflection Property 
for the ellipse) 


=a4 (vertically opposite angles), 
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and 
Bi = Bo (vertically opposite angles) 


= £3 (by the Reflection Property 
for the hyperbola) 


= B4 (vertically opposite angles). 


Since a] +02 +03+04+ Bi + Bo+ B3+ Ba = 
27, it follows that 


aj + Bj = 510 for any i and any /. 

In particular, v3 + 63 = 57, so that the tan- 
gents to E and H are perpendicular. In other 
words, EF and # intersect at right angles. 

9. Let the point P(acost, bsint) lie on the ellipse 
in standard form with equation oo y =la> 
b > O, and let the perpendicular from the focus 
F(ae,0) to the tangent at P meet that tangent 


at T. 
yA 
P 
an T 


> 
F (ae, 0) x 


By Theorem 2 of Subsection 1.2.1, the tan- 
gent at P has equation 


x-acost  y-bsint 
a? be” 
which we may rewrite in the form 


bx cost + aysint = ab. ©) 


From this we see that, if t ¢ {—/2,0, 7/2, 7}, 
the slope of the tangent PT is —(b/a) cott, so 
that the slope of the perpendicular FT must 
be (a/b) tant. Since FT also passes through 
F (ae, 0), FT has equation 


a a 
y—-—tant-x = ——tant-ae 
b b 


ae 


= —— tant, 
b 


which we may rewrite in the form 


2 


ax sint — bycost = a“esint. (+) 
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Then the coordinates of the point T(x, y) of 
intersection of PT and FT must satisfy both 
equations (*) and (**). So, squaring each of (*) 
and (**) and adding, we find that the coordinates 
of T must satisfy the equation 


(x? + y?) (Ga cos’ t + a sin’ ) = 
a’ (° + a’e? sin? t) : 
We then rewrite this equation in the form 
2 b? + a2e? sin? t 
b2 cos? t + a? sin’ t 


(a? - a’e’) + ae? sin? t 


x+y? =a 


2 
=a 
(a? — ae?\(1— sin? t) +a? sin? t 
2 1—e?+e*sin’t 
1— sin? t — e2 + e2 sin? t + sin? t 
=a’. 


It follows that the point 7 must lie on the 
2 as required. 

If t = 0 or z, the tangent to the ellipse at P 
is a vertical line perpendicular to FP; so the tan- 
gent at P meets the perpendicular to it from F 
at P — which lies on the auxiliary circle. 

Finally, if ¢ = +7 } 2, the tangent to the 
ellipse at P is a horizontal line with equation 
y = +b. The point T where PT is perpendicular 
to FT must thus satisfy the equations x = ae and 
y = +5; this lies on the auxiliary circle, since 


auxiliary circle x? + y? =a 


xo y? =ae+ (+b)? 


=a’e* + (i - e”) =a’. 


Section 1.3 


1. (a) The equation of the conic 


11x? + 4xy + 14y? — 4x — 28y — 16 =0 


can be written in matrix form x? Ax+ J’x+ 
H =0, where 


ii 2 =f 
a=(J ae (3). 
H=-16 and x= (*). 

y 
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(b) The equation of the conic 
x? —4xy + 4y? — 6x —8y+5=0 


can be written in matrix form x? Ax+ J’ x+ 
H = 0, where 


1 -—2 —6 
a=( a) I= (Gs): 
H=5 and 1). 

y 


2. (a) We saw in Problem 1 (a) that the matrix form 
of the equation of this conic is x? Ax+- J’ x+ 
H = 0, where 


11 2 —4 
a=(", o il) 
H=-16 and x=(*); 

y 


that 
11 2 x x 
oo al) +2) - 
16=0. 
First we diagonalize A. Its characteristic 
equation is 


O=det(A-AD=/ 5° 14, 


es a | 
= 17 — 250+ 150 
= (A—15)(A — 10), 


so that the eigenvalues of A are A = 15 and 
i = 10. The eigenvector equations of A are 


(1—A)x +2y =0, 
2x + (14—A)y =0. 
When A = 15, these equations become 
—4x+2y = 0, 
2x —y=0, 
so that we may take as a corresponding 


. 1 : , 
eigenvector >) which we normalize to 


have unit length as eval 
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When 4 = 10, the eigenvector equations 
of A become 


x+2y=0, 
2x +4y = 0, 
so that we may take as a corresponding 


‘ 2 . ; 
eigenvector \ which we normalize to 


2//5 
2) 
WV5 2/V5|_ 1 4g I 
o/s. =1j73| 2 CO 
so interchanging the order of the eigen- 
vectors as columns of P — in order that 
det P = +1, so that then P represents a rota- 
tion rather than a rotation composed with a 
reflection — we take as our rotation of the 
plane the transformation x = Px’, where 
— ( 2//5 1 KA 

ART /5 2/75)" 

This transformation changes the equation 
of the conic to the form 


(Px’)? A(Px’) + J’ (Px’) + H =0 


have unit length as ( 


Now 


or 


(x’)? (PT AP)x’ + (J’P)x’ + H =0. 


Since P'AP = (5 ia this is the 
equation 
, flO OY (x’ 
# (Y i5)G) 
ee j 
1429) |S .(%,)-16 =0. 
V5 4/5 


We may rewrite this equation in the form 
10x”? + 15y +.4y/5x’ — 12V/5y’ — 16 = 0, 
or 

12 Dat mM 4Avr\_ _¢-. 
10(x + ex )+15(y 4y') 16 = 0; 


so that, on completing the square, we have 


10(x'+ +) -2415 (»’- 2)" 
19-160 
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or 


10(x"+ 4)’ +15(y'- 3) -30=0 


or 


2 2 
(+2) UW, 


This is the equation of an ellipse. 
From equation (2) it follows that the cen- 


tre of the ellipse is the point where x’ + 
t —. Po Be 5 

We = ) = y Fi 7 0, that is, where 
(tee pe 2 7 

eae and y’ = J From the equa 


tion x = Py’, it follows that in terms of the 
original coordinate system this is the point 


2 1 1 
x\ J5 J5 aS 
a 1 2 2 
V5 V5 V5 


-(). 


that is, the point (0, 1). 

Since 3 > 2, it also follows from equa- 
tion (2) that the major axis of the ellipse has 
equation y’ — a = 0,or y’ = wat and the 

: ‘ : ! a 
minor — has equation x’ + a= 0, or 

Ps a 
elena 

Finally, since the matrix P is orthogonal 
we can rewrite the equation x = Px’ in the 


form x’ = P~!x = Px, so that 


or as a pair of equations 


ae ene D 
St = ey 
y= Jax t+ Sey. 


It follows that the equation, y’ = ae 
of the major axis of the ellipse can be 
expressed in terms of the original coordinate 


system as 


rs x+2y =2. 


Bt R=R 
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Similarly, the equation, x’ = — ra of the 
minor axis of the ellipse can be expressed in 


terms of the original coordinate system as 


Be R= ye 8 2x—-—y=-l. 


iy 
ye 


(b) We saw in Problem 1(b) that the matrix form 
of the equation of this conic is x7 Ax + 
J’x + H =0, where 


1 -—2 —6 
H=5 and t="): 
y 


that is 
1 —2\(x 
@ (4 V)G)+ 


(-6 -#)(*)+5=0 
y 


First we diagonalize A. Its characteristic 
equation is 


t= — 

0 = de — 20 =| - at 
= j* = 5), 
=A(A — 5), 


so that the eigenvalues of A are A = 0 and 
A = 5. The eigenvector equations of A are 


(1 —A)x —2y =0, 
—2x+(4-A)y =0. 
When A = 0, these equations become 
x—2y=0, 
—2x+4y =0, 
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so that we may take as a corresponding 


; 2 : : 
eigenvector i}? which we normalize to 


2 
have unit length as ( val 
When A = 5, the eigenvector equations 
of A become 


—4x —2y = 0, 
—2x —y=0, 
so that we may take as a corresponding 


: 1 : ' 
eigenvector 7) which we normalize to 


. 1/J/5 
have unit length as & val 
25 1jp/5 
iJ =2//5 
so interchanging the order of the eigen- 
vectors as columns of P — in order that 
det P = +1, so that then P represents a rota- 
tion rather than a rotation composed with a 
reflection — we take as our rotation of the 
plane the transformation x = Px’, where 
— ( ff5 2/5 

~ \=2f/5 1//5 
tion changes the equation of the conic to the 
form 


(Px’)’ A(Px’) + J’ (Px’) + H =0 


—__4 L 
~ 5 a I, 


Now 


) . This transforma- 


or 


(x)? (PT AP)x’ + (J’P)x’ + H =0. 


Since P’ AP = (; ) this is the equation 


NA 
+(—6 —8) . : ({))+5=0. 


V5 5 
which we can rewrite in the form 
5x? + 2/5x' — 4/5y' +5 =0. 
We may rewrite this equation in the form 


5 (e i 4,x') ~4/5y'+5=0 
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so that, on completing the square, we have 
2 
/ 1 = / = 
5(x + +) 4/5y'+4=0, 
or 


es sf =), = 4v/5(y' 2 +z) i 


or 


die ey 4(y_ 1 
(+33) =4('-z). © 
This is the equation of a parabola. (It is 
not quite in standard form (y’”)? 
but in the similar form (x”)? = 4ay”; the 
argument will be similar.) 
From equation (3) it follows that the ver- 


= 4ax”, 


tex of the parabola is the point where x’ + 


1 / 1 : 
> 0 = ye ~ 0, that is, where 
/ = = 
c= = 5 and y = Fe From the equa 


tion x = Px’, it follows that in terms of the 
original coordinate system this is the point 


NR 
i 
= 


(*)- v5 V5\[ v5 
y, 


ll 
geen 


that is, the point (3 3). 

It also follows from equation (3) that the 
axis of the parabola has equation x’ + x = 
0, or x’ = —-L. 

Then, since the matrix P is orthogonal we 
can rewrite the equation x = Px’ in the form 
x’ = P~!x = Px, so that 


; ae 
G)-(3 FO) 
ry | 2 1 
y Ve. Ss y 
or as a pair of equations 
ae eee 
fo et 


2 1 
y= vs * se 
It follows that the equation, x’ = ak. 
of the axis of the parabola can be expressed 
in terms of the original coordinate system as 


1 2 _ 1 
Go eS 


o x=—2yS =I. 
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3. Here we use Theorem 3. 


(a) The matrix of the non-degenerate conic is 


3 —4 
A= & so that 


det A = 6 — 16= —10 <0, 


so that the conic is a hyperbola. 
(b) The matrix of the non-degenerate conic is 


4 
ol so that 
detA = 16— 16=0, 


so that the conic is a parabola. 
(c) The matrix of the non-degenerate conic is 


52 —36 
A= & 73) 80 that 
det A = 52-73 — 367 = 3796 — 1296 
= 2500 > 0, 
so that the conic is an ellipse. 
Section 1.4 
1. The equation of the quadric surface given by 
Ox? <b 5y? =? 4 xy —3yz 
—2xz—2x —6y+ 10z —-12=0 


may be written in matrix form x? Ax + J’x + 
M = 0 where 


2 4 -1 =) 
A=| 5, 5 -3], J=|-6], 
=] -3 —] 10 
XxX 
x= ly and M=-—12 


Appendix 3: Solutions to the Problems 


Similarly, the equation of the quadric surface 
given by 
YO YEH XZ 
may be rewritten as 
yztxz—y=0, 


and so in matrix form x7 Ax + J’x + M =0 
where 


1 
oa 0 
A=]0 0 5], J=[-1 ]. 
1 1 
7 2 0 " 
Xx 
x=|y and M=0 


2. We saw in Problem | that the equation of this 
quadric E can be written in matrix form as 
x Ax + J’x + M = 0, where 


0 0 4 
A=]0 0 5], J=[-1 ]. 
1 1 
5 7 O 
Xx 
x= |y and M=0O, 
z 
that is, as 
00 3\/x 
(x y z)}0 0 5]ly 
LT 1 
2 3 
Xx 
+(0 -1 O0)f y} =0. 
& 


We start by diagonalizing the matrix A. Its 
characteristic equation is 


-- 0 4 
O=det(A-AD=/0 -a } 
$4 =, 
= (8-1) +4(0) 
=P + 5a 
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so that the eigenvalues of A are A = 0, a and 


= wo The eigenvector equations of A are 
—Ax +32 =0, 
Ay +42 =0, 
5x + sy —Az=0. 


When A = 0, these equations become 


1 1 
gy 


From the first two equations we get z = 0; it 
then follows from the third equation that x + 
y =0. So we may take as a corresponding eigen- 


1 
vector | —l1 ]}, which we normalize to have 
0 
ok 
/2 
: -1 
unit length as a 
0 
Similarly, when 4 = - qr We may take as a 
1 
corresponding eigenvector 1}, which we 
V2 
1 
2 
normalize to have unit length as 5 ; and 
ole 
v2 
Se all 2 
when A = 75° We may take as a correspond 
1 
ing eigenvector 1 |, which we normalize 
—/2 
1/2 
to have unit length as 1/2 
—1//2 
Now 
as 1 i 1 1 1 
v2 2 7 Yi 2 2 
1 1 1 
1 1} |0 + -+ 
0 4K v2 2 
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Since interchanging the first and second 
eigenvectors interchanges the first two columns 
of the determinant, in order to have det P= 1 
we choose to take as a convenient rota- 
tion of R? the transformation x= Px’, where 

ij72 v2 1/2 
P=| 1/2 -1//2 1/2 |. This trans- 
v2 0 | -1/72 
formation changes the equation of the quadric 
to the form 


(Px’)? A(Px’) + J’ (Px’) + M =0 


or 
(x')"(PTAP)x’ + (J7P)x’ + M = 0. 
1 
fe 0 O 
Then since P’AP = 0 O O |, the 
1 
0 0 a 


equation of the quadric becomes 


+0 0 \/x 
(x y zy} 0 0 0 y’ 


x! 

y | =9, 
1 1 Zz 
A 9 4 


which we can rewrite in the form 


NI NI 
Sir 

ns 

NI Ne 


+(0 —1 0) 


q(x’)? aly? 5x + ay 


or 


1\2 1\2 | eer / 1 ao 
(x') (z) Jat +P Fe = 90. 
Completing the square in this equation, we 
get 


/ 1 2 1 ‘0 
(2+ a) rere 
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We now make the transformation 
mre 
2/2 
x’ = x’ Ae 0 ; ( 1) 
1 
2/2 
so that we can rewrite the equation of F in the 
form 


(x’’)? _ Cb + y" —_ 0. (2) 
It follows from equation (2) that E must be a 
hyperbolic paraboloid. 


From equation (2) it follows that the centre of 
E is the point where x” = 0, y” = Oandz” = 0; 
that is, the point where x’ = 1/(2V2),y’ = 
0 and z) = —1/(2V2). From the equation 
x = Py’, it follows that in terms of the original 
coordinate system this is the point 


, 4: @ 1 
x a 2 - 2/2 
1 1 1 
v=o =A 2 0 
Z 1 9g 1] \- 
Wh Wf 22 
0 
={0], 
1 
2 


that is, the point (0. 0, 1). 


3. The line m through the points (2, 0,1) and 
(0, —/2, —1) can be parametrized as 


po/2,0,D +0 =n), =V2, =1) 
= (uV2, (uw — 1I)V2, 24 — 1), 


where yz € R. Points on the line lie in E since 


(uv 2)? + ((u = 172)? = Qu = 1)? 
= 2yu?42(u?—2w+1)—(4u?—4u+1) = 1. 
4. (a) Let the point (xo, yo, zo) € A, so that x9 + 
yo = 0 and zp = 0. Then 
x9 — Yo + 20 = (Xo + yo)(X0 — yo) 
+20 = 0, 


so that (xo, yo, Zo) lies in E. 
Similarly, the point (xo, yo, Zo) € B lies 
in E. 
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(b) The point (A—j,A+p, 442) lies in E, since 
(A w)? — (A+B)? + Grp) 


= (2? —2Au+ u) 


— (47 + 2am + m7) + 4au = 0. 
(c) Let A be a fixed number. Then 
€, ={A-pw,A+ 4a): uw € R} 
= {(A,A,0) + w(—1, 1,44): w € R}, 
so that £, is a line through the point (A, A, 0) 
parallel to the vector (—1, 1, 42). Clearly the 
point (A, A, 0) lies on B. 

This line passes through the point (A — 
1,A+1,4A) since it is the point on 2, where 
Vega le 

When A = 0, 

fo = {(-u, 1,0): w € R} 
= {u(-1,1,0): ne R}; 

so that this line is the line through the ori- 
gin parallel to the vector (—1,1,0). Thus, 
lo =A. 
Let yz be a fixed number. Then 
My ={(A—p,A+ pw, 4A): A € R} 

= {(—, #0) +A(1, 1,44) 24 € R}, 
so that m, is a line through the point 
(—p, 4,0) parallel to the vector (1, 1, 4,2). 
Clearly the point (—Z, 4, 0) lies on A. 

This line passes through the point (1 — 
,1 + ,4) since it is the point on m, 
where A = 1. 

When pu = 0, 

mo = {(A,A,0) : A € R} 
= {AC,1,0):A€ R}, 

so that this line is the line through the origin 
parallel to the vector (1, 1,0). Thus mo = B. 
Consider / first. Two distinct lines in # 
may be expressed in parametric form as 

{(A1 — W,A1 +, 4A14) : w € R} 
and 

{Q2-— mao +u',4aou’): uw’ ER}, 


for some real numbers ju and py’. 


(d 


wa 


(e 


wm 
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These lines meet where 
A bM=)ho- we, 
Ay + b= Ag+ is 


and 
4Aj = 4p’. 


Adding the first two equations gives 
2X1 = 2A2, so that A; = Ao. Thus the two 
lines can only meet if 4; = Az, so that all 
lines in the family -# are disjoint. 

A similar argument shows that all lines in 
the family .# are disjoint. 

Finally, if a line in & intersects a line 
in .@ they can have at most one point in 
common since the lines are non-parallel — 
for, one is parallel to (—1,1,0) and one is 
parallel to (1, 1,0). 

So, let the line 


ly, = {Ar — war + 1, 4Arp1) : 1 
varies over R} (3) 


in Y and the line 


Myy = {(A2 — 2,42 + 2, 4A2[2) : A2 
varies over R} (4) 


in @ meet at a point (a, b,c). 
For this to happen we must have 
(Ay — Mi, Ad + M1, 4011) 
for some real numbers 1, 41,42 and [2. 
We deduce from this equation that 
Ay — 1 = 42 — be, 
Ay + pi = A24+ po, (5) 
4011 = 4A2K2. 
By adding the first two equations in (5) we 
find that 2A; = 2A, so that A; = Ao; and 
by subtracting the first two equations in (5) 
we find that 241 = 2j2, so that wy = 2. 
For simplicity, we now assign the values 
A and yz to the common values of A;, Az and 
L1, 42, respectively. 
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It follows that the line in Y 
€,={A— wm A+ UW, 4Ap) : 


is a specific real number} 

and the line in .@ 
My ={A— p,rA+ pw, 4Ap) 2A 
is a specific real number} 


meet at one point (a, b,c) so long as (a, b,c) 
is that common value 


(Aq — €1,A1 + 1, 40141) 
= (A2 — b2, 2 + M2, 4212) 
= (A-—p,A+ pW, 4Ay). 


This happens when 
a=A— hp, 
b=)+ hp, 
c=4u; 

so that 


A = 3(a+b) and uw = 5(b —a). 


Chapter 2 
Section 2.1 


1. 


First, reflect the figure in the line through O, the 
centre of the circle, and P. Under this reflection, 
P remains fixed, and the circle maps onto itself. 
In particular, the point A maps to a point A’ on 
the circle, and so the tangent PA maps onto the 
line PA’. 


Now the tangent PA meets the circle at a sin- 
gle point A, so the image of the tangent must 
meet the circle at a single point. But the only 
way in which that can happen is if A’ coincides 
with B. 
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Hence the line segment PA is reflected onto 
the line segment PB. Since reflection preserves 
lengths, it follows that PA = PB. 

2. It is sufficient to show that there is an isometry 
which maps AABC onto ADEF. To construct 
this isometry, we start with the translation which 
maps A to D. This translation maps AABC 
onto ADB'C’, where B’ and C’ are the images 
of B and C, respectively. 


E. 
F 


Dx 
translat . <y : rotate 
a © ° x # . 


Since DC’ = AC = DF, we can now rotate 
the point C’ about D until it coincides with the 
point F. This rotation maps ADB’C’ onto the 
triangle ADB” F shown below, where B” is the 
image of B’ under the rotation. 


Now notice that 


ZEDF = ZBAC (given) 
= ZB'DC' (translation) 
= ZB" DF (rotation), 


so either B” lies on DE or the reflection of B” 


in the line DF lies on DE. Also 


ZDFE = ZACB (given) 
= ZDC'B' (translation) 
= ZDFB" (rotation), 


so either B” lies on FE or the reflection of B” in 
the line DF lies on FE. It follows that either B” 
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coincides with E or the reflection of B” in the 
line DF coincides with E. 

So, composing the translation, the rotation 
and (if necessary) a reflection, we obtain the 
required isometry which maps AABC onto 
ADEF. Since isometries preserve length and 
angle, it follows that BC = EF,AB = DE 
and ZABC = ZDEF. 


. Here we use the fact that a matrix U is orthogo- 


nal if U7U = I. We have 

cos@ sin@\ fcos@ —sin@ 1 0 
(: sin@ cos ») a cos 6 ) ~ € ') 
and 

cos@ sin@ cos@ sin@ 1 0 
ee bee ee eG '} ; 


So both matrices are orthogonal for all real 0. 


. First, t) 0 f2(x) is equal to 


A 
mA 
+ 


(Gad O) 


_—_— 
a 
aa 
| 
— 
_—_—— 
aa 
I 
oe 
n— 


VI ID UB WIL Gud Lal 
Mou YW Ul 
‘ “——_——” 
—~. 
— 
WI UIE iv tas 
ULE Ulu 


tal 
+ 


(G+) 


UIE UID Uo UI 


UL Ulu NN we 
eee 
—— 


Ps 
> ia 
| 
uo Un 
——— 
wn 
+ 
| 
NO = 
So 
“———_——— 
oa 
| 
re NY 
ae 


MW ae Uw Ul 
AE UW UR UW 


(1 o)**() 


5. (a) Since U is an orthogonal matrix, it follows 


that U-! = U?. Taking the transpose of 
both sides, we have 


0°)" =v) -v=(7y" 
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Thus U~! is an orthogonal matrix, and so f2 
is a Euclidean transformation. 


(b) We have 

t1 0f)(x) = t (u-'x - U-'a) 
=U(U"'x—U"la) +a 
=(x—a)+a 
=x 

and 

tz 0 ti (X) = h(Ux +a) 
=U! (Ux+a)—-U'a 
= (x+U-'a) —U'a 
=x, 


SO fz is the inverse of fy. 


6. We have 
3 _4\7! 3 4 
5 5 _ 5 5 
5 5 5 5 
and 
3. 4 
5 5 1 _ —1l 
4 3) \-2) 7 \-2)’ 
| 5 
so that 


3. 4 
ra =( 5 s)x+ (3) 
= 73 2 
> 35 


7. (a) Not congruent 
(b) Congruent 
(c) Not congruent 
(d) Congruent 
8. Suppose that we are given three plane figures 
F\, Fo and F3 such that 


F is congruent to F3 (1) 
and 

Fy is congruent to F3. (2) 
It follows from (2) and the symmetric property 
of congruence that 

F is congruent to Fp. (3) 
Hence from (1) and (3) and the transitive prop- 
erty of congruence, F; is congruent to F>, as 
required. 
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Section 2.2 


1. We use the fact that a 2 x 2 matrix is invertible 
if and only if its determinant is non-zero. 
Each transformation is of the form 


xh Ax+b, 


where A is a 2 x 2 matrix, and so it is an affine 
transformation if and only if the determinant of 
the matrix A is non-zero. 


(a) Here, 
1 3 
= 23 =>—1, 
I 
which is non-zero; hence ft, is an affine 
transformation. 
(b) Here, 
-—6 5 
= —-12-15= —27, 
Sa 
which is non-zero; hence fz is an affine 
transformation. 
(c) Here, 
—2 -l 
=-84+8=0; 
ar a 


hence f3 is not an affine transformation. 
(d) Here, b = 0 and 
5 -3 
= 10-6=4, 
22 


which is non-zero; hence f4 is an affine 
transformation. 


2. (a) Here, f; 0 f2(x) is equal to 


"(C3 a)x+() 
“(1 2)(G 2)=*() 
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3 Il 
0 9 


|=27-0=27 40, 


it follows that ft) 0 f2 is an affine transforma- 
tion. 
(b) Here, f2 o t4(x) is equal to 


a(( “a)a) 


Since 
—40 28 
| rT 2} = 200 ~ 308 = -108 +0, 


it follows that f2 0 ty is an affine transforma- 
tion. 
3. As described in Appendix 2, the inverse of 


a2 x 2 matrix A = ( 7) BAT = 
c d 


d -—b 
1 
Gabe ( a ) Hence 


=¢ 


and 
—2 3 4\ (14 
1 -1) \-2) — 6)’ 
so that 
ne ee 14 
r= jse(') 
Section 2.3 


1. Let (x, y) be an arbitrary point on the line 3x — 
y+1 =O, and let (x’, y’) be the image of (x, y) 
under ¢.Then 


()-G 2O+G). 


. First, we take b = (3). 
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Since the inverse of the inverse of any invertible 
transformation is the original transformation, it 
follows from Example | that under t—!, we have 


~()-E)EG) 


x=4x'+y'+2 and y=2x’+y—-1. 
Hence the image under ¢ of the line 3x — y + 
1 =0 has equation 


3(4x’ + y+ 2)—(2x’+ y’-1I+1=0. 
Dropping the dashes and simplifying, we obtain 
Sx+y+4=0. 


. The argument here is similar to that of Prob- 


lem 1. For, if (x, y) is an arbitrary point on the 
circle x? + y* = 1 and (x’, y’) is the image of 
(x, y) under f, then under t—! we have 


“O-E 9020) 


x=4x'+y'42 and y=2x’4+y'-1. 
Hence the image under t of the circle x7 + y? = 
1 has equation 


(4x' + y' +2)°+ 2x’ +y'-1°=1. 
Dropping the dashes and simplifying. we obtain 
10x? + 6xy + yy? +6x+y+2=0. 


3 
Next, we construct the matrix A whose first 


column is 

1 2 -1 

(3)-@)- (3) 

and whose second column is 

3 2 1\. 

(-1)- G)=(4) 
-1 1 

- ( 3 2) 

The required affine transformation ¢ is therefore 


t(x) = © 1) x+ (3) (x € R?). 


thus 
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: 1 
4, First, we take b = ( ) Next, we construct 


—2 
the matrix A whose first column is 


)-()=6) 


and whose second column is 


C)-)-): 
(9) 


The required affine transformation f is therefore 


t(x) = (; ) x+ (:) (x € R?). 


5. First, we find the affine transformation t; which 
maps (0,0), (1,0) and (0,1) to 1, —1), (2, —2) 
and (3,—4), respectively. This transformation 


has the form t; (x) = Ax + b, where b = ( i) 


thus 


and 

res 2-1 3-1 _ 1 2 
—2+1 -4+41 -1 -3 

that is, 


wo-(4 3e(4) 


Next, we find the affine transformation f 
which maps (0,0), (1,0) and (0,1) to (8, 13), 
(3,4) and (0,—1), respectively. This transfor- 
mation has the form f(x) = Ax + b, where 


that is, 


-5 -8 8 
(x) = (5 a+ (45): 


We now require the formula for the inverse 
transformation i . Since 


Ci 2) --G 2-0 3 
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and 


it follows that 


2 1 
oC} -() 


The required affine transformation f is therefore 
t(x) = ho ty (x), where f2 0 ty (x) is equal 


“(Ca )-@) 


Section 2.4 
1.(a) 


B 
: c 


By Ceva’s Theorem, we have 


AR BP CQ _ 


; AR _ 3 AQ _ 3 CQ _ 
First, RR 3: Next, oC =3 and so oA = 
2 


3 . 
It follows that 


Remark 
This means that P is the midpoint of BC. 
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(b) 2, 


ve) 
LX A(-1,1) 
7 


C 
r ss (a) Here we use the formula for calculating 
ratios given at the beginning of Appendix 2, 
By Ceva’s Theorem, we have just above the Section Formula. This gives 
AR BP C 
Se. BP xp—xgp %-2 
RB PC QA = = = 2, 
PC xc—xXxp = - 
Since a = 5 and a= = -%, it follows that 
; \ C CO. 49-26 2=3 1 
cn a, ee OA =m = 
2 7 QA ‘ 
AR - z+1 3 
so G2 — —7. — *R a eee = " 
RB XB—XR = 3 2 
Thus 
Remark a 
This means that Q lies on CA beyond A, and EI ES Tee ls 
the length of CQ is seven times the length of Q divides CA in the ratio | : 3, 
AQ. R divides AB in the ratio 3 : 2. 
(c) (b) It follows from part (a) that 
¥ AR BP CQ _ 3 t. 1 
RB PC QA 2 ae 
R 
of] Thus by the converse to Ceva’s Theorem, 
the lines AP, BQ and CR are concurrent. 
3. (a) 
A 
B P c 
By Ceva’s Theorem, we have - 
AR BP CQ | 1 Q ; 
RB PC QA ~ B \. ? 
: BP _5 C 
We are given that Fo = 7 and oA = —7, so By Menelaus’ Theorem, we have 
AR 5 AR BP C 
“* 2 (Neal. La 
RB 7 RB a QA 
Hence 4% = —3. We are given that 4 ox = 2 and _ = 2, 
so 
Remark 2. (—2). ee =-l. 


This means that R lies on BA beyond A, and the Oe 


C. 
length of AR is one fifth of the length of RB. Hence om =4 
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Remark (a) Here we have 
This means that Q lies on CA between C BP xp—xp 3 +2 
and A, and the length of CQ is one quarter PC tesa = jae =-3, 
of the length of QA. a 
CQ xg-Xc | gl, 
= = = z= 
QA xXa-xXQ 2-5 
(b) 6 AR XR-XA _ 12 _ 
RB xgp—xr  -2-1 3 
Thus 
4 P divides BC in the ratio — 3: 1, 
L 
Q divides CA in the ratio | : 1, 
B : P R divides AB in the ratio | : 3. 
By Menelaus’ Theorem, we have (b) It follows from part (a) that 
AR BP CQ 1 (-3)-1=-1 
ae EF ee 4 RB PC OA 3 = 
Be. Ee A Thus by the converse to Menelaus’ The- 
orem, the points P, QO and R are collinear. 
We are given that ah = i and — = —?2, r Q 
Ne) 
5. 
A 
1 Cc 
(-7):» S8- 1. R 
Q 
6% 
Hence ot = —2. B POG T 
By Ceva’s Theorem, we have 
AR BP CQ | (1) 
Remark RB PC OA ~ 
This means that Q lies on CA beyond A, and Also, by Menelaus’ Theorem, we have 
the length of CQ is twice the length of QA. AR BT CO ; ' 
RB TC QA © 
Comparing (1) and (2), we deduce that 
4. BT_ BPs, 
Tc F3C- = 
6. 
A D 
B (0) C 
By the Fundamental Theorem of Affine 


Geometry (Theorem |, Subsection 2.3.2), there 
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exists a unique affine transformation t which 

maps A, D and C to A‘(0,1), D‘(0,0) and 

C’(1, 0), respectively. 

(a) Since t maps AD onto the vertical line A’ D’, 
and BC is parallel to AD, it follows that the 
image of BC under ¢ must be a vertical line. 
Also, since t maps DC onto the horizontal 
line D'C’, and AB is parallel to DC, the 
image of AB under ¢ must be a horizontal 
line. It follows that B’, the image of B under 
t, must be the point with coordinates (1, 1). 

(b) Since P is the midpoint of AB, its image 
P'=t(P) must be the midpoint of A’B’ 
since ratios along a line are preserved by f. 
Hence P’ = (5. 1). 

Since the slope of D’P’ is 2 and the line 
passes through the origin, the equation of 
the line D’ P’ must be 


y =2%; (3) 


Next, since Q is the midpoint of BC, its 
image Q’ = t(Q) must be the midpoint of 
B’'C'. Hence Q! = (i, +). 

Then the slope of the line A’Q’ must be 

1-51 
0-1 2 

Since the line passes through the point 
A’(0, 1), the equation of the line A’Q’ must 
be 


1 
—l=-—=(x -0); 
y 5 ) 
that is, 
1 
=-- I; 4 
y get (4) 


Now, R’, the image of R under tf, must 
lie on the lines D’P’ and A’Q’, so that its 
coordinates must satisfy both (3) and (4). 
Substituting for y from (3) into (4), we 
obtain 


1 
2x =-= 1, 
x yr 


so that x = 2. It follows from (3) that 


y= z. Thus R’ is the point (3. 4). 
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Comparing the y-coordinates 1, - and 0 
of P’, R’ and D’, we obtain P’R’ : R’D! = 
1:4. 

Since ratios along a line are preserved by 
the affine transformation ¢~!, it follows that 


PR: RD=1:4. 


Finally, comparing the x-coordinates 0, g 
and 1 of A’, R’ and Q’, we obtain A’R’ : 
R'Q' = 2: 3. Since ratios along a line are 
preserved under the affine transformation 
t~!, it follows that 


AR: RQ =2:3. 


7. The matrix M for the triangle of reference 


AABC is 


whose inverse is 


It follows from the representation (7) that the 
point (—1,1) has barycentric coordinates with 
respect to the triangle of reference AABC 
given by 


G.-= 2 = 1 
=] =| -2 |; 
=| 4 y) 


namely, barycentric coordinates (1, —2, 2). 


. (a) By Theorem 6, the points with barycen- 


tric coordinates (1,1,—1), (4,-—-2,—1) and 
(3, 2,- 3) are collinear if and only if 


1 

i 4 4 
1-2 2/=0 

3 

-1 -1 -3 
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Now, 
i a 4 
2 _ 
{ =<% 2 -| Z * | 
af =f -3 =h 3 
1 2 cit =2 
_—4 1 
[4 Alaa | 
=G+2)-4(-34+2)+ 3-1-2) 


It follows that the points are not collinear. 
(b) By Theorem 6, the points with barycentric 

coordinates (1, 1, 

are collinear if and only if 


a | 
1 —2 7 |=0. 
=| f = 
Now, 
i 2 =i 
i =2 9 ear 
=| it =5 
i 4 1 = 
-2} 1) 7 |-| 2 | 
=(10=7) =2(5 +7) == 2) 
= 3—2(2)-(-1) 
=3-4+1 
=0. 


It follows that the points are collinear. 

By the Corollary to Theorem 6, the equa- 
tion of the line through the points (1, 1, —1) 
and (2, —2, 1) is 


1), 2, -2, 1), (-1,7, —5) 
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Now, 
1 2 2.4 
1 —2 nl= 1 
-1 1 ¢ $ 
1 1 —2 
tlre | 
= (—2¢ — 9) — 2 +) +§(1—2) 
It follows that the equation of the desired 
line is 
&E+3n+4¢ =0. 
Section 2.5 
1. 
A 
4 t 
Nt 
R 7 2 
R 
B 9 CB’ ; Cc 


P 
First, map the ellipse onto the unit circle, by 
some affine transformation rf. Since tangency is 
preserved by affine transformations, the image 
under f of the triangle AABC is another triangle 
AA'B'C’, whose sides are tangential to the unit 
circle. 

These sides touch the unit circle at P’ = 
t(P), QO’ = t(Q) and R’ = t(R). 

By Problem 1| of Section 2.1, the two tangents 
from a point to a circle are of equal length, and 
so (ignoring the directions of line segments) 


A’QO' — A’ R’ 
C’P’ — C'O’. 


B'P' = B'R’ and 


In terms of signed distances, it follows that 


C’ if 
: re = +1, 
in fact the product must equal | since P. Q and 
R are internal points of the sides of the triangle 
and therefore each of the above three fractions 
is positive. 


B'P’ 
P’C’ 


A’R' 
R'B! 
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Since ratios of lengths along a line are not 
changed by the inverse affine transformation 
t—!, we deduce that 

AR BP CQ 

aa oe OA = 
It follows from the converse to Ceva’s Theorem 
that the lines AP, BQ and CR are concurrent. 


A’ 


p 


B’ 


AN 
lea 
C4 


T' 
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under affine transformations, f maps the centre, 
O, of H to the centre of H’, namely the origin. 

Let m’ be the image of OP under t. Then m’ 
passes through the origin and the point (1, 1), 


so its equation is y = x. Clearly, H’ is sym- 
metric with respect to m’. Now let ¢’ be the line 
with equation y = —x; this is perpendicular to 


m'. By symmetry, m’ bisects all chords of the 
rectangular hyperbola which are parallel to ¢’. 

But the properties of parallelism and of ratios 
along a line are preserved by affine transforma- 
tions, so if £ is the line r—! (£’), then O P bisects 
all chords of H which are parallel to £. 


First, map the ellipse onto the unit circle, by Chapter 3 
Section 3.2 


1. (a) This represents the same Point as [1, 2, 3], 


some affine transformation rf. Since tangency is 
preserved by affine transformations, the images 


under ¢ of the tangents TA and 7B are tangents 
T'A’ and T’ B’ to the unit circle. 

Let P’ be the point of intersection of the 
chord A’B’ with the line joining T’ to O’, the 
centre of the unit circle. By symmetry, the tri- 
angles A T’A’P’ and A T'B’ P’ are Euclidean- 
congruent and so A’ P’ = B’ P’; in other words, 
P’ is the midpoint of A’B’. 

Let the line joining T to the centre O of the 
ellipse meet AB at P. Then, since P’ is the 
midpoint of A’B’ and since midpoints of line 


segments and centres of ellipses are preserved 
by the inverse transformation t!, P=tl(P) 
is the midpoint of AB = t~!(A’B’). Hence OP 
bisects all chords of H that are parallel to 2. 


First, map the hyperbola H onto the rectangular 
hyperbola H’ = {(x,y) : xy = 1} by some 
affine transformation ft, in such a way that f 
maps P to the point (1,1). Since the property 
of being the centre of the hyperbola is preserved 


for if A = 2, then 
(2,4, 6] = [A, 2A, 3A] = [1, 2, 3]. 


(b) This does not represent the same Point as 
[1, 2, 3], for there is no A that satisfies 


(c) This represents the same Point as [1, 2, 3], 
for if A = —1, then 


[—1, —2, —3] = [A, 2, 3A] = [1, 2, 3]. 


(d) This does not represent the same Point as 
[1, 2, 3], for there is no A that satisfies 


ll=A, 12=2)4, 13= 32. 


. In each case we multiply by the least com- 


mon multiple of the denominators (or any inte- 
ger multiple of the least common multiple) to 
obtain: 


(a) [}. +—-3| = [6,4,—1] (multiply by 8); 
(b) [0. 4, 3| = [0, 12,2] (multiply by 3) 


0 
| = [l,—2,—3] (multiply 


Appendix 3: Solutions to the Problems 


3. Dividing by the first coordinate in each, we 
obtain: 


3 5 
[-20, -30, 50] = | 1, 3,-3 


[74, 148, 0] = [1, 2,0]. 
Hence the homogeneous coordinates 


[2,3,—5], [—8,—12,20], [—20, —30,50] 


all represent the same Point. The other homo- 
geneous coordinates represent different Points. 

4. In each case we seek an equation of the form 
ax + by + cz = 0 which is satisfied by the 
homogeneous coordinates of the given pair of 
Points. 


(a) An equation for the Line through [0, 1, 0] 
and [0, 0, 1] is x = 0. 

(b) An equation for the Line through [2, 2, 3] 
and [3,3,7] is x = y. 

5. We use the strategy for determining an equa- 

tion for the Line through two given Points 
given in Subsection 3.2.2. 


(a) An equation for the Line through the 
Points [2,5, 4] and [3, 1, 7] is 


x y & 

2 5 4/=0 

3 1 7 

Now 

xyz 

=| be | 
25 4,=x -—y +z 
317 17 37 31 

= 31x —2y — 13z, 


so an equation for the Line is 


31x —2y — 13z=0. 
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(b) An equation for the Line through the 
Points [—2, —4, 5] and [3, —2, —4] is 


x y 2 
2 -4 5/=0 
a ..@ =A 
Now 
x y Zz = 
9. =a 5| =x |") 4 
¢ =? =A 
ay? 15-2 = 
P| Soa) a 


= 26x + 7y + 16z, 
so an equation for the Line is 
26x + 7y + 16z = 0. 


6. We use the strategy for deciding whether 
three Points are collinear given in Subsection 
3.2.2. 


(a) The Points [1,2,3],  [1,1,—2] and 
[2, 1, —9] are collinear if and only if 
1 2 3 
1 1 -2)=0. 
2 1 -9 
Now 
1 3 
1 1 -2)= | 5 
2 1 -9 
1 -2 1 
“ah leh 
= 1(—-9 + 2) — 2(-9 + 4) + 31 — 2) 
=—-74+10-3 
= 0. 
It follows that the three given Points are 
collinear. 
(b) The Points [1,2,—1], [2,1,0] and 
[0, —1,3] are collinear if and only if 


1 2- =] 
2 1 0) = 0. 
0 -l 3 
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7. 


8. 


Now 
1 2 -l 
2 1 ofa] | 
0 -i1 3 
2 0 2 1 

= SI-!o i 

= 1(33 — 0) — 2(6 — 0) — 1(—2 — 0) 
=3-1242 

=-70. 


It follows that the three given Points are 
not collinear. 


We have already shown that [1, 0,0], [0, 1, 0], 
{1, 1,1] are not collinear, so this leaves three 
other cases to consider. 

First we check that [1, 0, 0], [0, 0, 1], [1, 1, 1] 
are not collinear. This follows because [1, 0, 0] 
and [0,0, 1] lie on the Line y = O, whereas 
[1, 1, 1] does not. 

Next we check that [0, 1, 0], [0, 0, 1], [1, 1, 1] 
are not collinear. This follows because [0, 1, 0] 
and [0,0,1] lie on the Line x = O, whereas 
[1, 1, 1] does not. 

Finally we check that [1,0,0], [0, 1,0], 
[0, 0, 1] are not collinear. This follows because 
[1, 0,0], [0, 1, 0] lie on the Line z = 0, whereas 
[0, 0, 1] does not. 

(a) At the Point of intersection [x, y,z] of the 
two Lines, we have 


x—y—z=0, and (1) 
x+5y+2z=0. (2) 


Subtracting equation (1) from equation (2), 
we obtain 6y + 3z = 0, so z = —2y. 

Next, substituting —2y in place of z in 
equation (1), we obtain x — y+2y = 0, so 
x=-y. 

It follows that the homogeneous coor- 
dinates of the Point of intersection are 
[—y, y,—2y] (where y 4 0), which we 
may rewrite equivalently as [—1, 1, —2]. 
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(b) At the Point of intersection [x, y, z] of the 
two Lines, we have 


x+2y—z=0, and (3) 


2x + y—4z=0. (4) 


Subtracting twice equation (3) from equa- 
tion (4), we obtain —3y — 2z = 0, so 
y= —4z. 

Next, substituting —3z in place of y in 
equation (3), we obtain x — v4 —z=0,s0 
<= dz. : 

It follows that the homogeneous coor- 
dinates of the Point of intersection are 

te, —32,2 (where z ~ 0), which we 


may rewrite equivalently as [7, —2, 3]. 


. First, we find equations for the two Lines, 


using the determinant formula. 
An equation for the Line through the Points 
[1,2, —3] and [2, —1, 0] is 


x y Z 
1 2 -—3/=0 
2 -1 0 


1 2 -3 =x| 7 7 
a =f 9 
|i =] 5: ; @ 
Ylo ol *l2 -1 
= x(0 — 3) — y0+ 6) + z(—1 — 4) 
= —3x — 6y — 5z. 


Hence an equation for the Line may be writ- 
ten as 


3x + 6y +5z =0. (5) 


Next, an equation for the Line through the 
Points [1,0,—1] and [1, 1, 1] is 


x oy Zz 
1 0 -l|=0. 
1 1 1 
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10. 


11. 


Now 
x y 

0 | ' | 
1 0 -l|=x —y 

1 1 1 1 
1 1 

4 1 0 

“|1 1 


=x(0+1)—-yd+1)4+zd —0) 
=x—2y+z. 


Hence an equation for the Line may be writ- 
ten as 
x—2y+z=0. (6) 

At the Point of intersection [x, y, z] of the two 
Lines, both equations (5) and (6) hold. 

Adding three times equation (6) to equation 
(5), we obtain 6x + 8z = 0, soz = —ix. 

Next, substituting z = —ix into equation 
(6), we obtain 


2 : =0 ae 
x o= 4° » SOY = Gx. 


Hence the homogeneous coordinates for the 
Point of intersection of the two Lines are 
Xx, aX, —ix (where x 4 0), or, equivalently, 
[8, 1, —6]. 
In this particular case, the homogeneous coor- 
dinates of the Points are particularly simple, so 
we can write down equations for the two Lines 
without using determinants. 

An equation for the Line through the Points 
[1,0,0] and [0,1,0] is z = O (since this is 
of the right form, and passes through the two 
Points). 

An equation for the Line through the Points 
[0,0,1] and [1,1,1] is x = y (since this is 
of the right form, and passes through the two 
Points). 

The two Lines meet where z = 0 and x = y, 
so the homogeneous coordinates for their Point 
of intersection are [x,x,0] (where x 4 0), or, 
equivalently, [1, 1,0]. 

In Problem 10 we found that the Point of 
y is [1, 1,0]. 
Similarly, the Point of intersection of y = 0 


intersection of z = 0 and x = 
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and z = x is [x,0,x] or [1,0, 1], and the Point 
of intersection of x = 0 and y = z is [0, z, z] 
or [0, 1, 1]. 


(0, 1, 0} (0, 0,1] 


e 


2. The ideal Points for z consist of lines through 
the origin of R? that are parallel to 7. These 
are the Points that lie on the ideal Line y = 0. 


ideal Line 
y=0 


embedding 
Plane y =-1 


[2, 4, 6] 


The Euclidean point of z which corresponds 
to the Point [2, 4, 6] is that multiple of (2, 4, 6) 


which lies on the plane y = —1. That is, 
I _ I 3 
—4(2,4,6) = (-4,-1,-3). 
Section 3.3 


1. (a) The mapping 


t: [x,y,z] +> [-2y+3z,-—x+5y—z, —3x] 


can be expressed in the form [x] > [Ax], 
where 


0 —2 3 
A=]-1 > =] 
—3 0 0 
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det A = |—1 2 =] 
—3 0 O 


al 
=| 4 

= 6 

=0+2~x (-3)+3x 15 

= 39 £0, 


so A is invertible. It follows that t is a 
projective transformation, and that A is a 
matrix associated with f. 


(b) The mapping 


wm 


t: [x,y,z] [x —7y+4z,-x + 5y —z, 
x — 9y + 7z] 


can be expressed in the form [x] > [Ax], 
where 


det A 


ll 
| 
aa 
n 
| 
nm 


5 =1 
=o 7 


=f, 

1 = 
=1x26+7x(-6)+4x4 
= 0, 


II 
= 


-1 -1 
eri 


+4| 


so A is not invertible. It follows that f is not 
a projective transformation. 


(c) The mapping 


wm 


t: [x,y,z] [x -1+z,2y —47+5, 2x] 


cannot be expressed in the form [x] te 
[Ax], where A is a 3 x 3 matrix whose 


3; 
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entries are real numbers. Hence f cannot be 
a projective transformation. 


. (a) The image of the Point [1,2,—1] under ¢ is 


given by 
1 1-1 1 4 
-1-—2 1 2 =]||—6 ; 
4-3 4/ \-l —6 


that is, the Point [4, —6, —6] = [—2, 3, 3]. 
(b) The image of the Point [1, 0, 0] under ¢ is 
given by 


1 1 -l 1 1 


| 
— 
| 
N 
- 
So 
II 

| 
— 


4 =3 a) \o 


that is, the Point [1, —1, 4]. 
(c) The image of the Point [0, 1, 0] under f is 
given by 


i 1 =1\ 70 1 
af = 1 1t Pell a2 Tl, 
4 -3 4) \o =3 


that is, the Point [1, —2, —3]. 
(d) The image of the Point [0, 0, 1] under ¢ is 
given by 


i i = 6 = 
<3 Gilets va 
4-3 ai 4 


that is, the Point [—1, 1, 4]. 
(e) The image of the Point [1, 1, 1] under ¢ is 
given by 


1 @d =) fi 1 
= 2 PPT eto Te 
43 4) \4 5 


that is the Point [1, —2, 5]. 
Since the matrix A which represents the trans- 
formation in Example | in Subsection 3.3.1 
has 2 as its top left-hand entry, we obtain the 
required matrix by dividing each entry of A by 
4. This gives the matrix 


1 1 
7 O gq 
ae I _3 
q 9) 4 
foal 5 
q q 4 
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4. Matrices associated with f; and fo are 


2 1 0 5 8 0 
Aj = -1 0 1 and Ad = 3 5 0 5 
0 1 1 0 0 2 


respectively. It follows that a matrix associ- 
ated with the projective transformation f; 0 f2 is 
Aj Ad. Now 


2 1 O\ /5 8 0 
AjA,=]-1 0 1 3 <2 0 
Oo 1 iit 0 0 2 
13. 21 0 
={[-5 -8 2], 
3 5 2 


so we conclude that t; 0 fo is the transformation 
[x,y,z] h> [13x + 21ly, —5x — 8y 4+ 2z, 3x 
+5y + 2z]. 


Next, i has an associated matrix AS given 
by 


1 1-1 
Aj'=]-1 -2 2 
1 2-1 


The projective transformation f, ' is therefore 


[x, y,Z] H> [x+y—z, —x-—2y+2z,x+2y—zZ]. 


5. The equation of the Line can be written in the 
form Lx = 0, where 


Lat 2 =: 


From Problem 4 we know that ft, ' has an 
associated matrix 


1 1-1 
Ay'=[-1 -2 2}, 
1 2-1 
Ne) 
1 1-1 
LA; =(1 2 -1)[-1 -2 2 
1 2 -1 
=(-2 -5 4). 


The required image is therefore the Line 


—2x —5y+4z =0. 
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. First we consider the images under f;. The 


image of the Point [1, —1, 1] under ¢; is 


—4 -1 1 1 —2 
=3 =2 1 -1 = 0 : 
4 2 =) 1 1 


that is, the Point [—2, 0, 1]. 
Similarly, the image of the Point [1, —2, 2] 
under fj is 


—-4 -1 1 1 0 
—3 -—2 1 —2 = 3 , 
4+ 2 =] 2 —2 


that is, the Point [0, 3, —2]. 
Finally, the image of the Point [—1,2, —1] 
under fy is 


—4 -1 1 -1 1 
—3 -2 1 2 a —2 ; 
4 2 =] -1 1 


that is, the Point [1, —2, 1]. 
Next, we consider the images under f2. The 
image of the Point [1, —1, 1] under f is 


=§ <6 =2 1 =4 
3 4 7)[-1 ]]= 0 |}, 
6 OG a4 1 2 


that is, the Point with homogeneous coordinates 
[—4, 0, 2] or (equivalently) [—2, 0, 1]. 

Similarly, the image of the Point [1, —2, 2] 
under fp is 


—8 -6 -2 1 0 
—3 4. 7 —2 a 3 ; 
6 O —-4 2 —2 


that is, the Point [0, 3, —2]. 
Finally, the image of the Point [—1,2,—1] 
under f2 is 


8 =6 —2 1 =) 
4 a F 2|/= 4], 
6 © =A) \e1 2g 


that is, the Point with homogeneous coordinates 
[—2, 4, —2] or (equivalently) [1, —2, 1]. 


7. We use the strategy preceding the problem. 
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(a) Let A be the matrix (c) Let A be the matrix 
—u —3v 2w 2u v 0) 
0 2v 0}. u O 3w 
0 0 4w 0 -v -w 
We wish to choose u, v, w such that We wish to choose u, v, w such that 
—u —3uv 2w 1 1 2u 0) 0 1 3 
0 2u 0 1 |= 2 |, u 0 3w 1 |}={-1 |, 
0 0 4w 1 —5 0 -v -w 1 2 
that is, that is, 
—u—3uv+2w 1 2u+v 
2u = 2]. u+3w | =]-1 
4w —5 —U-—w 2 
It follows that w = -} and v = 1. Also, It follows that 
—u—3v+2w=1,sou=—¥%. 2u+vu =3, (1) 
Thus ut+3w=-l, (2) 
13) _3 -3 
2 iy 9p 
A=| 0 > 0). v—-—w=2. (3) 
0 0 —5 Adding equations (1) and (3) in order to 


A simpler matrix for the projective trans- eliminate v, we obtain 


formation is the matrix 2u—w=S5. (4) 
I =6 .=3 Subtracting equation (4) from twice 
2A=]| 0 4 O}. equation (2) in order to eliminate u, we 
0 0 —10 obtain 7w = —7 orw = —1. 
(b) Let A be the matrix It follows from equation (4) that u = 2, 
1 0 0 i aia equation (1) that v = —1. 
0 0 w 
oe 6 4 -1 0 
A= {2 0 -3 
We wish to choose u, v, w such that 0 1 1 
u 0 0 | 3 8. We use the strategy preceding Example 6. 
0 20m I Si Ne (a) By Problem 7(a), a matrix associated 
a } 2 with the projective transformation f, that 
that is, maps [1,0,0], [0, 1,0], [0,0, 1], [1, 1,1] to 
u 3 the Points [—1,0,0], [—3,2,0], [2,0, 4], 
w l= 4 |. [1,2, —5], respectively, is 
7) 5 13 -6 -—5 
It follows that u = 3, v = 5 and w = 4. Aj=| 0 4 9 
Thus 0 0 -10 


3 00 (b) By Problem 7(c), a matrix associated 
A=1{0 0 4]. with the projective transformation ft, that 
05 0 maps [1,0,0], [0, 1,0], [0,0, 1], [1, 1,1] to 
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the Points [2,1,0], [1,0,—1], [0,3,—1], 
[3, —1, 2], respectively, is 


4 -1 0 
Az=|{2 O -3 
0 1 1 
(c) A matrix associated with t, lis Ay’, which 
we can calculate to be 
Jt 3 1 
13 26 26 
A; =|0 § 0 
0 0 - 
However, a simpler matrix associated 
with ¢; | is 
20 30 —10 
260A;' =| 0 65 0 
0 O —26 


Hence a matrix for the projective trans- 
formation which maps [—1, 0, 0], [—3, 2, 0], 
[2,0,4], [1,2,—5] to [2,1,0], [1,0,—1], 
[0,3,—1], [3, —1,2], respectively, is given 
by 

4 -l 0 20 30 —10 
2 0 -3 0 65 0], 
0 1 1 0 O —26 


which equals 


80 55 —40 
40 60 58 
0 65 —26 


9. We use the strategies in Subsection 3.3.3. First, 
it follows from the first strategy that a matrix 
for the projective transformation ft, that maps 
[1,0,0], [0, 1,0], [0,0, 1], [1, 1, 1] to the Points 
[1,0,-—3], [1,1,—2], [3,3,-—5], [6,4,—13], 
respectively, is 


—3u —2v —Sw 
where u, v, w Satisfy 


u Vv 3w 1 6 


(j=) 
e 
Ww 
s 
ll 
a 


—3u —2v —5Sw 1 -13 


533 
that is, 
utvu+3w 6 
v+3w — 4 
—3u —2u —5w —13 


By comparing the first two rows of these 
matrices, it is clear that we must have u = 2. 
It then follows from the last two rows that 


v+3w =4, (5) 

—2u —Sw = —-7. (6) 

If we add twice equation (5) to equation (6), 
we obtain w = |. Equation (5) then gives v = 1. 


A matrix for the projective transformation 1; is 
therefore 


2 1 3 
A={ 0 1. 3 
x6 =2 <5 


Next, it follows from the first strategy that 
a matrix for the projective transformation 
that maps [1,0,0], [0,1,0], [0,0,1], [1,1, 1] 
to the Points [3,—5,3], [5,-1.0], [3,-5.6], 
[8, —13, 12], respectively, is 


3u su 3w 


—5u —v —5w], 
3u 0 6w 
where u, v, w satisfy 
3u su 33w \ (SI 8 
—5u —v_ ~—5Sw 1 = ]-13 |], 
3uSs«*0 6w 1 12 
that is, 
3u+ su +3w 8 
—5u—v—5w =] -13 
3u + 6w 12 


Equating corresponding entries in these 
matrices and multiplying them by 2,1 and 7 
respectively, we deduce that 


6u +vu+6w = 16, (7) 
5u—v—5w=—13, (8) 
u+2w=4. (9) 
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Adding equations (7) and (8), we obtain 
u+w=3. (10) 


It follows from equations (9) and (10) that 
w = 1 andu = 2. Equation (8) then gives 
v= =2, 

Hence a matrix for the projective transforma- 
tion fo 1S 


6 -l 3 
A,=]-10 2 —-5 
6 0 6 


Finally, we follow the second strategy. 
A matrix associated with t, lis 


1 


I 
7 7 0 
Ay'=|-9 4 -3], 
3 -1 1 


so a simple matrix associated with t, lis 


1-1 0 
2A;'=|-18 8 -6 
6 -2 2 


Hence a matrix for the projective transforma- 
tion that maps [1,0,—3], [1, 1,—2], [3,3,—5], 
[6,4, 13] to (3, -5,3],| },—-1,0] ,13,—5,6], 
[8, —13, 12], respectively, is given by 


6 -l 3 L =] 0 
-10 2 -5 —18 8 —6], 
6 O 6 6 —2 2 
which equals 
42 —20 12 
—76 36 —22], 
42 —18 12 


So we may take as a simple matrix associated 
with the transformation 


21 —10 6 
—38 18 —-ll1 
21 —-9 6 


Section 3.4 


1. Since the problem is concerned exclusively with 
projective properties, we shall prove the corre- 
sponding projective result. By the Fundamental 
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Theorem of Projective Geometry, we may take 
A, B,C and U to be the Points [1, 0, 0], [0, 1, 0], 
[0, 0, 1] and [1, 1, 1], respectively. 

First, we find the homogeneous coordinates 
of the Points A’, B’ and C’. 


Since the Line AU passes through [1, 0, 0] and 
[1, 1, 1] it must have equation y = z. Also, BC 
passes through [0, 1,0] and [0,0, 1] and there- 
fore has equation x = 0. It follows that AU and 
BC intersect at the Point A’ with homogeneous 
coordinates [0, y, y], that is, at [0, 1, 1]. 

Since the Line BU passes through [0, 1, 0] and 
[1, 1, 1] it must have equation z = x. Also, AC 
passes through [1,0,0] and [0,0, 1] and there- 
fore has equation y = 0. It follows that BU and 
AC intersect at the Point B’ with homogeneous 
coordinates [x, 0, x], that is, at [1,0, 1]. 

Since the Line CU passes through [0, 0, 1] and 
[1, 1, 1] it must have equation x = y. Also, AB 
passes through [1,0,0] and [0, 1,0] and there- 
fore has equation z = 0. It follows that CU and 
AB intersect at the Point C’ with homogeneous 
coordinates [x, x, 0], that is, at [1, 1, 0]. 

Next, we determine the homogeneous coordi- 
nates of the Points P, Q and R. 

The Line B’C’ has equation 
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that is, x = y +z. Thus the Lines B’C’(with 

equation x = y + z) and BC (with equation 

x = 0) meet at the Point P with homogeneous 

coordinates [0, y, —y], thatis, at P = [0, 1, —1]. 
The Line A’C’ has equation 


that is, y = x + z. Thus the Lines A’C’ (with 

equation y = x + z) and AC (with equation 

y = 0) meet at the Point Q with homogeneous 

coordinates [x, 0, —x], that is, at Q = [1,0, —1]. 
The Line A’B’ has equation 


that is, z = x + y. Thus the Lines A’B’ (with 
equation z = x + y) and AB (with equation 
z = 0) meet at the Point R with homogeneous 
coordinates [x, —x, 0], that is, at R = [1, —1, 0]. 

Finally, the Points P, Q and R are collinear if 


O 71 <1 

1 oO -1/=0 

1-1 O 

Since 
0 1 =! 
1 O -1|=0| 9 “ol O 
1-1 O 
1 O 
| 
+ ( | 4 


=0-10+1)—-1(-1—0) 
= 0, 


it follows that the Points P,Q and R are 
collinear, as required. 

. The following figure illustrates Desargues’ The- 
orem on an embedding plane z for which Q 
(the Point of intersection of AC and A’C’) is an 
ideal Point. Since P and R are collinear with 
the ideal Point Q where AC and A’C’ meet, it 
follows that PR is parallel to AC and A'C’. 
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P[1, 0, 0] 


Let P, C, Q, C’ be the Points [1,0,0], 
[0, 1, 0], [0, 0, 1], [1, 1, 1], respectively. Then the 
Line PC’ passes through the Points [1, 0, 0] and 
[1, 1, 1], and therefore has equation y = z. 

Since B is a Point on PC’, it must have homo- 
geneous coordinates of the form [a,b,b], for 
some real numbers a and b. Now, b ¥ 0, since if 
b = 0 we must have B = P. It follows that we 
may write the homogeneous coordinates of the 
Point B in the form [p, 1, 1] (where p = a/b). 

Similarly, the line QC’ has equation x = y, so 
that the homogeneous coordinates of the Point A 
are [1,1,q]. 

We now find the Point B’ where PA meets 
QC. The Line QC has equation x = 0. Since 
the Line PA passes through the Points P = 
[1,0,0] and A = [1, 1, gq] it must have equation 


— 
re OoO< 


z 
0}, 
q 


which we may rewrite as —gy + z = 0. 

It follows that at the Point B’ of intersec- 
tion of the Lines PA and QC we must have 
z = qy and x = 0, so that the Point B’ has 
homogeneous coordinates [0, 1, q]. 

Similarly, the Point A’ is the Point of intersec- 
tion of the Lines PC and QB. Since P = [1,0, 0] 
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and C = [0,1,0], the Line PC has equation 
z = 0; and since Q = [0,0, 1] and B = [p, 1, 1], 
the Line QB has equation x = py. Hence the 
Point A’ has homogeneous coordinates [p, 1, 0]. 

Since the Line AA’ passes through A = 
[1,1,q] and A’ = [p, 1, 0] it must have equation 


x y Zz 
1 1 q\|=0, 
p 1 0 


which we may rewrite as 


—qx + pqyt+ (1 — p)z=0. 


Since the Line BB’ passes through B = 
[p, 1, 1] and B’ = [0, 1, q] it must have equation 


x y Zz 
p |i 1=0, 
0 1 q 


which we may rewrite as 


(q — 1x — pqy + pz = 0. 


The Line CC’ has equation x = z, so AA’, 
BB’ and CC’ will be concurrent if the equations 


—4qx + pqy + (1 — p)z = 9, 
(q— Dx — pay + pz = 9, 
x—z=0, 

have a non-zero solution. This happens if 


-q pq 1-p 
q-1 -pq_ p 
1 0 -1 


= 0. 


But the determinant on the left is equal to 
p)+ (1 — p)(pq) 
=—py —pat+pa+pa+pa-p4 
= 0, 


qpq — pql-—4q 


so AA’, BB’ and CC’ are concurrent. 
4. The statement of the collinearity strategy (Sub- 
section 3.2.2) is follows: 


Three Points (a, b,c], [d,e, fl, [g,h, k] 


a ob. ¢ 
are collinear if and only if \d_ e f|=0. 
g h_ k 
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To make it easier to dualize this result, by break- 
ing the sentence up into shorter portions, we first 
rephrase it in the following way: 


Three Points 
la, b,c], [d,e, f], [g,h, k] 


are collinear 


abe 
if and only if |\d e f|=0. 
g h_ k 


We first interchange ‘Point’ and ‘Line’, and 
‘collinear’ and ‘concurrent’; this gives the state- 
ment: 


Three Lines 
la, b,c], [d,e, f], [g,h, k] 


are concurrent 


abe 
if and onlyif|\d e f|=0. 
g h_ k 


Next, we make the changes necessary for this 
statement to make sense; it becomes: 


Three Lines 
ax+ by+cz=0, dx+ey+fz=0, 
ex+hy+kz=0 


are concurrent 


abe 
if and onlyif|\d e f|=0. 
g h k 


This is the result dual to the collinearity strategy. 


Section 3.5 


1. (a) First, we find real numbers aw and # such 
that the following vector equation holds: 
(3,5,-5) = a(1,-1,—-1) + Bd, 3, —2). 
Comparing corresponding coordinates on 
both sides of this vector equation, we 
deduce that 


3=a+8, 
5=-a + 38, 
—5 = -a — 26. 


Appendix 3: Solutions to the Problems 


(b 


wm 


By adding the first two equations, we obtain 
8 = 46, so B = 2. The first equation then 
gives a = 1. (As acheck, notice that these 
values of a and £ satisfy the third equation 
too.) 

Next we find real numbers y and 6 such 
that the vector equation 


(1, —5, 0) = yd,—-1,-) + 6(1, 3, —2) 


holds. Comparing corresponding coordi- 
nates on both sides, we deduce that 


l=y+6, 
—5=—y + 36, 


By adding the first two equations, we obtain 
—4 = 46, sod = —1. The first equa- 
tion then gives y = 2. (As a check, notice 
that these values of y and 6 satisfy the third 
equation too.) 

It then follows from the definition of 
cross-ratio that 


ABCD) = & a oe 
uncon! /ba2/ a+ 


First, we find real numbers a and 6 such 
that the following vector equation holds: 


(—3, —5, —8) = a(1, 2,3) + B(2, 2,4). 


Comparing corresponding coordinates on 
both sides of this vector equation, we 
deduce that 


—3=a+ 26, 
—5 = 2a + 28, 
—8 =3a+ 46. 


By subtracting the first equation from the 
second, we obtain a = —2. The first equa- 
tion then gives 26 = —1, or B = —5. (As 
a check, notice that these values of a and 6 
satisfy the third equation too.) 

Next, we find real numbers y and 6 such 
that the vector equation 


(3, —3,0) = y(1, 2,3) + 8(2, 2,4) 
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holds. Comparing corresponding coordi- 
nates on both sides, we deduce that 


Bop kg 
0 = 3y +48. 


By subtracting the first equation from the 
second, we obtain y = —6. The first equa- 
tion then gives 28 = 9, or 6 = 3. (As a 
check, notice that these values of y and 6 
satisfy the third equation too.) 

It then follows from the definition of 
cross-ratio that 


BJ) =f 3 1 
ABCD)= — = , 
( ) ef: =} = 4 


. We have 
A=[1,-1,-1], B=[1,3,—2], 
C = [3,5,—5], D=[1,—5, 0]. 


To determine (BACD), we first find real num- 
bers a and #6 such that the following vector 
equation holds: 


(3,5, —-5) = a(1,3,—-2) + BU, —-1,-1). 


It follows from the result of Problem 1 (a) that 
a = 2 and 6 = 1 (essentially, the letters w and 
B in that Problem have been interchanged). 

Next, we find real numbers y and 6 such that 
the vector equation 


d, —5, 0) = y (1,3, -2) aa 6d, =, —l) 


holds. It follows from the result of Problem 1(a) 

that y = —1 and 6 = 2 (in this case, the letters y 

and 6 in that Problem have been interchanged.) 
Hence from the definition of cross-ratio, 


ee ae ee 
( a y 2/-1 4 


To determine (AC BD), we first find real num- 
bers a and #6 such that the following vector 
equation holds: 


(1,3, —2) = a1, -1,—-1) + 8,5, —S). 
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Comparing corresponding coordinates on 
both sides of this vector equation, we deduce 
that 


l=a+36, 
3=-a +58, 
—2=-a —5f. 


By adding the first two equations, we obtain 
4=88,sop= 7 The first equation then gives 
a= —5. (As a check, notice that these values 
of aw and f satisfy the third equation too.) 

Next, we find real numbers y and 6 such that 
the vector equation 


(1, —5,0) = yd, -1, -1) + 6(3,5, —5) 


holds. Comparing corresponding coordinates on 
both sides, we deduce that 


l=y +36, 
—5=—-yt+56, 
0=-y—56. 


By adding the first two equations, we obtain 
—4 = 85,sod = —3. The first equation then 
gives y = 3. (As a check, notice that these 
values of y and 6 satisfy the third equation too.) 

It follows from the definition of cross-ratio 
that 


3 1 a 
(ACBD) = B = 4 + =5. 
ce a a) ae 


3. From the solution to Problem 1(a) we know that 
(ABCD) = —4, so by Theorem 2, 


(ABDC) = 1/(-4) = -3, 
(DBCA) = 1 — (—4) =5, 
(ACBD) = 1—-(—4) =5. 
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Using the sign convention for ratios of lengths 
along a line, we obtain 


(ABCD) = a ae 
CB / DB 
3 6 
=(-1) /C8) 
=2 
and 
(DBCA) = cules Za 
CB / AB 
3 6 
=()/C) 
=-l. 


5. In each case we use Theorem 2 to bring the ideal 


Point to the front of the cross-ratio, and then 
apply equation (11). 
(a) If D is an ideal Point then 


(ABCD) = 1—(DBCA) (swap outer 
two Points) 


= (DCBA) (swap middle 
two Points) 
AC : 
= — (from equation 
BC 
(11)). 
(b) If C is an ideal Point then 
(ABCD) =1-—(ACBD) (swap middle 


two Points) 


= (DCBA) (swap outer 
two Points) 

=1/(CDBA) (swap first 
two Points) 

AD 

=1 BD (from 
equation (11)) 

_ BD 

=a 

6. Since B is an ideal Point, 
(ABCD) = os 
~ DA 
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7. 


8. 


Let A and B denote the signs, C denote the 
car, D denote the junction, and let A’, B’, C’, D’ 
be their images on the film. Then 


A’C’ A’ D' 
C’B’ 7 D'B' 
_ 3 6\ 10 
~* 3 4} 9° 
Now let the car be n km from the junction. 
Then 


(A’B'C'D') = 


(ABCD) = 2° oe 
CB] DB 
3-n 3 
aia) 
_ 26=n) 
~ 3(2—n) 


Since (ABCD) and (A’B’C'D’) must be 
equal, it follows that 


2(3 —n) 10 
32—n) 9 
Hence 
54 — 181 = 60 — 30n, 


and so 12n = 6, orn = 5 . The car is therefore 
5 km from the junction. 
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Let A and C denote the stations, B denote the 
train, and D denote the ideal Point where the 
railway lines ‘meet’, and let A’, B’,C’, D’ be 
their images on the film. Then 


AC’ A’D’' 
(ABCD )= cp / DB’ 


ll 

| 
NI 
7 

| 
QI oo 
a 


Now let the train be n km from C. Since D is 


an ideal Point, 
ABCD) = = 
( BC 
_ 50 
= 
But (ABCD) and (A’B’C’D’) must be equal, so 
50. 3 
n 2 
Hence n = ie The train is therefore 334 km 
from the next station. 


Chapter 4 
Section 4.1 


1. ZA 


(«/z', y'/z',1) 


ideal Point 


x 


Let F be the projective figure which corre- 
sponds to the parabola {(x, y,z) : y = 47 = 
1} in the standard embedding plane a. Any 
Point [x’, y’,z’] on F must pierce z at a point 


(x'/z', y'/z’, 1) on the parabola, and so 


Since z’ is non-zero, we can multiply by (z’)? 
and drop the dashes to obtain the equivalent 
equation 
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yZ= x, 2 # 0. Thus 
By dropping the constraint that z # 0, we sy = 2x} + 3x11 —yp tar +2y1 41 
can include additional ideal Points of the form {940-014-041 


[x, y, 0]; for these points we must have x7=0. 
In fact, there is just one such Point, namely 
[0, 1,0]. 59 = 2x2 + 3xoyo — ys + x2 +2 + 1 
2. (a) This — not cone a PrOsenye conic —846—-1424241 
because it includes linear terms in x and y. 


= 4, 


(b) This defines a projective conic because it = 18, 
has the form of equation (2) with A = B = x12 + x2y1 
C=G=landF=H=0. S12 = 2x32 +3( —— —— ] — yiya2 
(c) This defines a projective conic because it 
has the form of equation (2) with A = B = + (= as 2) +2 (25) +1 
G=H=0,C =land F =-1. z 2 
(d) This does not define a projective conic 443 (- + *) jj (- + *) 
because it includes a constant term. 2 2 
3. (a) This is true since O+1 
+2{——]+1 
(1-0)+ (1-0) + (0-0) =0. 
= 343 
(b) This is true since niet ae eee 
2 2 2 =k 
2-(1)° — (2)° + (1-2) + 1-0) + () 
and 
=2-—4+4+2=0. xyy+tx 
5) = 2xyx $3 (We) — yy 
(c) This is false since 
Xp +x yity 
2 2 22 _ = 2 1 
3-1°42-2°—3° =34+8-9=240. +( 5 )+ ( 5) )+ 
4. First, we rewrite the equation of the hyperbola = 2x43 y+0 04 1+x 
2u2 — 6v2 + 5v — 1 = Oin the form ~ 2 2 
5 i O+ 
2 2 y 
=3 —y=— =; pees 
u vo + 5° 5 + 2( 5 ) +1 
so that we can apply the Eccentricity Formula. = ’ ee 7 y+ 3 


Then, if we apply the Eccentricity Formula 
with C = —3, G = 3 and H = —3, it follows 
that the eccentricity e of the hyperbola is given 


2. Here we use the results of Problem 1 and 
Joachimsthal’s Section Equation soak? +2siok+ 


by the formula —" 
7 It follows that the ratio in which the hyperbola 
e? = 1—(-3) =4, divides the segment is k : 1, where 
2 _ 0): 
so that e = 2. Le es 
it follows that 
Section 4.2 9k? + 9k +2 =0, 
1. Here we use Joachimsthal’s notation, with or 
(x1, y1) = C0), (x2, y2) = (2, 1) and (Gk4 21Gb + 1h = 0, 


s = 2x7 43xy—y?+x4+2y4+1. so that k = —3 ork = —}. 
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Hence the hyperbola divides the line segment 
at two points, in the ratios —4 : 1 and 4 as 
that is, in the ratios —2:3 and —1:3. The minus 
signs indicate that the intersection points lie 
outside the actual segment. 

The minus signs indicate that the intersection 
points lie outside the actual segment. 

3. Here we use the formula s; = 0. 


(a) In this case, 
s =x? —xy+2y-7, 


so that at the point (x1, yj) = (—1, 2), 


2x — 2 
=p (FS *)42(25*) <7 


= —2x + ay = 5. 
The equation of the tangent at (—1, 2) is 
therefore 
—2x + 3y-5=0, 
or 


4x —3y+10=0. 
(b) In this case, 
s = 3x? +2xy — y? +x —2y -3, 
so that at the point (x1, yj) = C1, 1), 


sy a3-ex42(2S*) 1.5 


1+x l+y 
2 3 
+( 2 ) ( 2 ) 
aie 


It follows that the equation of the tangent 
at (1,1) is 


or 
9x —2y —7=0. 
4. We use the formula st = s- $11 for pairs of 
tangents. 
Here, 
s=4xy+1, 
so that at the point (x1, yj) = (2, 1), 


sy =4-2-14+1=9 


541 


and 


=2x+4y+1. 
It follows that the equation of the tangent pair 
iS 
(2x +4y +1)? = 4xy+1)-9. 
Multiplying this out, we obtain 
4x? + 16y?+1+4+2(8xy+2x+4y) = 36xy+9, 
so that 
x? —S5xy+4y*4+x+4+2y-2=0. (1 


Since one of the tangents has equation y = 
x — 1, it follows that (x — y — 1) must be 
a factor of equation (1). Using this fact, it is 
straightforward to check that equation (1) may 
be factorized as 


(x —y—1)(x —4y4+ 2) =0. 


Hence the equation of the other tangent from 
(2, 1) is 
x—-4y+2=0. 


. Here we use the formula s; = 0 for the polar. 


The equation of the hyperbola is s = 0, where 
s=2x74+xy—3y’?+x-6, 
so that at the point (x1, yj) = (,—1), 


2) -3-Coy 


2 


1+x 
—6 

>) 
= 7 11 


a2 Lent ( 


It follows that the equation of the polar of 
(1, —1) is 


4x+7y—-11=0. 


. Lets = y* + 27 + 2xy — 4yz 4+ zx, so that the 


equation of the projective conic is s = 0. 


(a) Ata general Point [x1, yi, z1], 
Ss. = yy +221 + (iy + xy) 
— 2(yiz + yz1) + 5 (zie + 241). 


wm 


So at the Point [1, 0,0], which lies on E, 
we have 


s} =0+0+4+(y+0)— 200+ 0) + 400+2) 
=yt 52. 


Hence the equation of the tangent to E at 
[1,0, 0] is 


yt+4z=0, 
or 
2y+z=0. 


The Point [0,1,—2] lies on this tangent, 
since 


2-1+(-2)=0. 
At a general Point [x1, y1, Z1], 
5} = yyy +221 + Gy +xy1) 
— 2(yiz + yz) + 5 (zie + 2x1) 
and 
su = yp +e + 2x11 — 4yizi + zim. 
Thus at the Point [0, 1, —2], 
Ss} =y—2z+ (04x) —2(z2 -—2y) 
+ 5(—2x +0) 
=5y-4z 
and 
sy} =1+4+4+0-4.- (—2)4+0=13. 


Now, the pair of tangents from the Point 
[0,1,—2] to E is given by the equation 
sa = 5 - S11, that is, 


(Sy —4z)* = (y? +27 +2xy —4yz+zx)-13. 

Multiplying this out, we obtain 

25y” — 40yz + 16z” = 13 y* + 13z” + 26xy 
— 52yz + 13zx, 

so that 


12y* +327 —26xy+12yz—13zx = 0. (2) 


— 
QO 
wm 
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But we know that the Point [0, 1, —2] lies 
on the tangent in part (a), so 2y + z must 
be a factor of the left-hand side of equation 
(2). (This fact serves as a useful check on 
the working so far.) Hence equation (2) can 
be expressed in the form 


(2y + z)(6y + 3z — 13x) = 0. 


It follows that the equations of the two 
tangents to the projective conic are 


2y+z=0 and 6y+3z—-— 13x =0. 


The polar of the Point [0, 1, —2] is given by 
the equation s; = 0, and we know from the 
results of part (b) that at this Point 


sy= Sy — 4z. 


Thus the equation of the polar of [0, 1, —2] 
with respect to E is 


Sy —4z2 =0. 


Section 4.3 


1. Let the projective conic have equation 


Ax? + Bxy+ Cy? + Fxz+Gyz+ Hz? =0. 


Since [1,0,0] lies on the projective conic, 
we must have A = 0. Similarly, since [0, 1, 0] 
and [0,0,1] lie on the projective conic, we 
must also have C = 0 and H = 0. Thus the 
equation of the projective conic reduces to the 
form 

Bxy + Fxz+Gyz=0. 
Since [1,1,1] and [—2,3,1] also lie on the 
projective conic, we deduce that 
B+F+G=0 (1) 
and 
—6B —2F4+3G=0. (2) 
Adding 6 times equation (1) to equation (2), we 
obtain 4F + 9G = 0, or F = —3G. Substi- 
tuting for F into equation (1), we deduce that 
B = 3G. 
It follows that the equation of the projective 


conic must be of the form 


2Gxy - 4Gxz + Gyz =0, 
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or 
Sxy —9xz+4yz =0. 
2. Let any such projective conic have equation 


Ax’ + Bay Cy’ + Fxz+Gyz+ Hz =0. 


Then, as in Problem 1, it follows from the fact 
that [1, 0, 0], [0, 1, 0] and [0, 0, 1] lie on the pro- 
jective conic, that its equation must reduce to 
the form 


Bxy + Fxz+Gyz=0. 


Since [1,2,3] also lies on the projective 
conic, we deduce that 


2B+3F +6G=0. 


Any projective conic satisfying this condition 
contains the given four Points. 

There are many such possibilities. For exam- 
ple, taking B = 3, F = 2 and G = —2 gives 
the projective conic 

3xy + 2xz — 2yz = 0; 
while the choice B = 6,F = —2,G = —1 
gives the projective conic 


6xy — 2xz— yz=0. 
3. (a) Let x’ = Ax, so that 


x! 1 2 3\ /x 
y J =]2 3 4] ]y]. 
Zz 3.4 6J \z 
Since 
2. 3 —2 1 
23 4 Oo J=]{ O07, 
3 4 6 1 0 
I 2 3 0 0 
23 4 —3 J}=]-1]., 
3 4 6 2 0 
and 
t 2 3 1 ; 
23 4 —2),= 
3 4 6 1 
it follows that the images under ¢ 
of [—2,0,1], [0,—3,2], [1,—2,1] are 


(b) 


(c 


wm 
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[1,0,0], [0,—1,0]=[0,1,0] (since the 


coordinates are homogeneous  coordi- 
nates), [0, 0, 1], respectively. 
Since 
|! .2.-°3 —2 0 1 
23 4 0 3-2 
3 4 6 1 -2 1 
1 0 0 
={0 1 O}], 
0 0 1 
—2 
it follows that 0 _— is the 
1 


1 2 3 
inverseof}2 3 4 

3 4 6 
Next, x = A7!x’ so that 


x —2 0 1 x 
y|J= 0 3-2 y' |; 
z 


thus, 
x=—2x' +2’, 
y = 3y’ — 27, 
and 
z=x'—2y'42Z. 
It follows that t maps the given projec- 


tive conic onto the projective conic with 
equation 


17(—2x' + 2’)? + 47(—2x' + z’) 
x (By! — 22’) + 32y’ — 22’)? 
+ 67(—2x' + z')(x' — 2y’ +z’) 
+92(3y' — 22')(x! — 2y' + 2’) 
+ 66(x’ — 2y’ +2’)? =0. 


After simplifying this equation (with 
much cancellation), we find that it can be 
written in the form 


2x'y' — 3y'z' — zx’ =0. 
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4. 


5. 


—1/3 0 0 
(d) Finally, let B = 0 -il 0}; 
0 oO 1/2 
then a matrix for a suitable projective 
transformation is 
-}$ 0 0\/l 2 3 
BA = O -1l1 OJ} [2 3 4 
0 0 5/\3 4 6 
-} Fa 
={-2 -3 -4 
2 2 3 
Since the projective transformation is 
unaltered if we multiply this matrix by —6, 
then another matrix associated with this 
projective transformation is 
2 4 6 
12 18 24 
-9 -12 —-18 
The projective transformation ¢ with which 
this matrix is associated maps the given 
projective conic E onto the standard pro- 
jective conic xy + yz+zx = 0. 
This case corresponds to a representation of 


Pascal’s Theorem in an embedding plane for 
which R is an ideal Point. Since P and Q are 
collinear with the ideal Point R where AB’ and 
A’B meet, it follows that PQ is parallel to AB’ 
and A’B. 

The statement of Pascal’s Theorem becomes 
the following. 

Let A,B,C, A’, B’ and C’ be six distinct 
points on a plane conic, with BC’ and B’'C 
intersecting at P, CA’ and C’A intersecting at 
Q, and AB’ parallel to A’B. Then PQ is paral- 
lel to AB’ and A’B. 


(a) If the Line with equation 91x — 60y — 


109z = O touches the projective conic 


(b 


we 


6. (a) 
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with equation x? + y* — z? = 0 at some 
Point P = [a,b,c], say, then its equa- 
tion must be ax + by — cz = 0 (or some 
multiple of this). Comparing the equations 
91x—60y—109z = 0 and ax+by—cz = 0, 
we see that P must have homogeneous 
coordinates [91, —60, 109]. Since 


917 +60" = 109° 
= 8281 + 3600 — 11881 
= 0, 


it follows that [91,—60, 109] does lie on 
the projective conic. Thus the given Line 
is a tangent to the projective conic, and the 
Point of tangency is [91, —60, 109]. 

If the Line with equation 4x +5y+3z = 0 
touches the projective conic with equation 
x* + y? — z* = 0 at some Point P = 
[a,b,c], say, then its equation must be 
ax+by—cz = 0 (or some multiple of this). 
Comparing the equations 4x + 5y + 3z = 
0 and ax + by — cz = 0, we see that 
P must have homogeneous coordinates 
[4,5, —3]. 

However, since 


4? +5? — (-3)? = 16+ 25-9 
= 32 £0, 
it follows that [4,5, —3] does not lie on the 
projective conic. Thus the given Line is not 
a tangent to the projective conic. 
As in Problem 1, the Points [1,0, 0], 
[0, 1,0] and [0,0, 1] lie on E;. The Points 


[2,2,—1] and [2,—1,2] also lie on Ey 
since 


(2)(2) + (2)(-1) + (D2) = 0 


and 


22)(-D + (-)@) + @)@) = 0. 
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(b) Since 
1 -1 0 1 1 
2 Oo} =|{0], 
1 1 2 0) 1 
1 -1 0 0 -l 
2 lj = 0 |, 
1 1 2 0) 1 
1 -1 0 0 0 
0 2 OJ} = ]{2], 
1 1 2 1 2 
1 -1 0 2 0) 
0 0 2 2 )= {1-2 |, 
1 i <2 -1 2 
and 
1 -1 0O 3 
0 0 2 -1)/=1]4], 
1 I 2 P) 


the images under ¢, of the five Points 
in part (a) are [1,0,1], [-1,0,1], 
[0,2,2] = [0, 1, 1], [0, —2,2]=[0, —1, 1] 
and [3, 4, 5], respectively. 

It is easy to verify that the five Points 
[1,0, 1], [—1,0, 1], [0, 1, 1], [0, —1, 1] and 
[3,4,5] all lie on the projective conic E2 
since their coordinates satisfy its equation 
x? + y?— 2? =0. 

By the Five Points Theorem (Subsec- 
tion 4.3.1, Theorem 1) five distinct Points 
uniquely determine a non-degenerate pro- 
jective conic. It follows from the results of 
parts (a) and (b) that the projective trans- 
formation ft; with associated matrix A = 

1 -l1 0 

0 Q 2) maps Ej onto Ep. 

1 1 2 
(d) Since 


(c 


wm 
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—2 0 
a 0 —2 O 
0 O -2 


1 0 0 
(—2);0 1 O], 
0 0 1 


a matrix associated with the projective 
transformation f2 o ft; is —2I. Hence 
another matrix associated with the projec- 
tive transformation f 0 ft is the identity 
matrix I; it follows that f2 o t; must be the 
identity projective transformation, and so 
tz 1s the inverse of f. 

Hence since B is a matrix associated 
with fo, it is a matrix associated with 
the projective transformation that maps E2 
onto EF. 


7. We saw in Problem 3, part (d), that a matrix 


associated with a projective transformation that 
maps the projective conic EF with equation 


17x? +47xy +32y* +67xz+92yz +662" = 0 


onto the projective conic E’ with equation x y+ 
yz+zx =0is 


2 4 6 
A, =] 12 18 24), 
-9 -12 —-18 


and in Problem 6, part (c), that a matrix associ- 
ated with a projective transformation that maps 
E’ onto the projective conic E” with equation 
x?+y*—27 =0is 


1 -1 0 
As=]0 O 2 
1 1 2 


It follows that a matrix associated with a pro- 
jective transformation that maps the projective 
conic E onto the projective conic E” is 


1 -1l O p 4 6 


A2Ay = | 0 0 2 12 18 24 
1 1 2 -—9 -12 —-18 

—10 —-14 —18 

= |-18 -—24 —36 


-—4 —<¥ <6 
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Since the projective transformation is unal- 
tered if we multiply the matrix by the non-zero 


factor -3, we conclude that a matrix associ- 
ated with this projective transformation is 
fl 5 7 9 
— zA2Al ={/9 12 18 
2 1 3 


. The Point [1,2v2, 3] lies on the projective 


conic with equation xe a = 2°: first, we 
express its homogeneous coordinates in the 
— [ier 2¢ : 
form [x,y,1] = [55. te? | as described 
before Theorem 9 in Subsection 4.3.4. Since 


[1.2v2, 3| — [}. ee 1], we have 


2/2 

1+? 3 142003 
From the first equation we have 3 — 37 = 
1+17, so that 2 = 4r? ort = +(1/V2). Since, 
from the second equation above, 2t/(1 + t?) 
is positive, it follows that we must have ¢ 
positive. Hence, at the Point P,t = 1//2. 
Remark It is important not to simply assert 
that 


(ga ee a 


1-r 1 3 2t 
a 


2/2 = 2 


(No such ¢ exists!) The coordinates are homo- 
geneous coordinates, so that it is their relative 
ratio that matters not the value of any particular 
coordinate. 


and 3=14+?2°. 


P (1-17, 2, 1 +27] 


(a) By Joachimsthal’s theory, the equation of 
the tangent at a Point [x1, y1, zi] on E is 


XxX, + yyy — 22, = 0. 


(b) 


(c) 
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Hence the equations of the tangents 


at A[0,1,1], B[1,0,—1], C[0,1,—1], 
D{1,0, 1] and P[1 — t?,2r, 1+ 27] are 
y-—z=0, 
x+z=0, 
y+z=0, 
x—-z=0 


and 
(1 —1*)x + 2ty —(1+17)z =0, 


respectively. 

At the Point A’ we have that y — z = 0 
and (1—77)x + 2ty — (1+ 27)z = 0. 
It follows that y = z and (1 — eye _ 
(1—2t+1?)z = 0. So either ¢ = 1 (so 
that P = [0,2,2] = [0,1,1] = A, which 
we can ignore) or (1+f)x — (1 —1f)z = 0. 
Hence the homogeneous coordinates of A’ 
are [1 —t,1+¢,1+f¢]. 

At the Point B’ we have that x + z = 0 
and (1 — 1?) x + 2ty —(1+1*)z = 0. It 
follows that x = —z and 2ty — 2z = 0. 
Hence the homogeneous coordinates of B’ 
are [t, —1, —f]. 

At the Point C’ we have that y + z = 0 
and (1—77)x + 2ty — (1+ 27)z = 0. 


It follows that y = —z and (1-17) x — 
(1+2r+27)z = 0. So either t = —1 
(so that P = [0,—2,2] = [0,—1,1] = 


C, which we can ignore) or (1 — t)x — 
(1 + t)z = 0. Hence the homogeneous 
coordinates of C’ are [1 +¢,—-1++¢,1—f]. 
At the Point D’ we have that x — z = 0 
and (1—1?)x + 2ty —(1+17)z = 0; 
thus, x = z and 2ty — 21?z = 0. So 
either t = O (so that P = [1,0,1] = D, 
which we can ignore) or y — tz = 0. 
Hence the homogeneous coordinates of D’ 
are [1,t, 1]. 
We now find the cross-ratio (A’B’CD’), 
using the definition of cross-ratio in Sub- 
section 3.5.1. 
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Firstly, we have to find real numbers 
a and # such that the following vector 
equation holds: 


Geyel2er=% 


Sel= ses ps ee ew, 


Comparing corresponding coordinates on 
both sides, we deduce that we must have 


1+t=a(i—ft)+ ft, (3) 
—1l+r=a(1+t)—B, (4) 

and 
1—t=a(1+f) — ft. (5) 


Adding equation (3) and equation (5) we 
get that 2 = 2a, so that a = 1; substi- 
tuting for a into equation (4) we get that 
B=14+t4+1-t=2. 

Next, we have to find real numbers y 
and 6 such that the following vector equa- 
tion holds: 


(i290 G1 =), 


Comparing corresponding coordinates on 
both sides, we deduce that we must have 


l=y(—?)+ 6, (6) 

t=y(1+t)—6 (7) 
and 

l=y(1+1)—6t. (8) 


Adding equation (6) and equation (8) we 
get that 2 = 2y, so that y = 1; substi- 
tuting for y into equation (7) we get that 
6=-t+14+r= 1. 

It then follows from the definition of 
cross-ratio that 


(A'B'c'p’y = BY 
ad 


261 


=, 
= 2. 


ee 
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Section 4.4 


1. In each case we use the definition of associated 
matrix given before the problem. 


(a) Here a suitable matrix is 


1 
1 -= -l 

2 
1 3 
as 3 a 
2 2 
3 1 
-] = = 
2 2 

(b) Here a suitable matrix is 

1 3 
Pes ee 
2, 2 
1 1 
ap - 24 = 
2 2 

3 1 
— = 4 

2 2 


2. We follow the method in the proof of Theorem 2 

of Section 4.4. 
(a) A matrix associated with the projective 
conic with equation x? — 4xy + 2y* — 4yz+ 


3z7 = Ois 
i =—2 06 
Ant 2 ye 2 
0 2 3 


(b) We start by diagonalizing the matrix A. Its 
characteristic equation is 


0 = det (A — Al) 


| 
=~ 
— 

| 
> 
wm 


+2] +0 


= (=a) (22-5442) +2(-6 +24) 


S23) + 67 = 35 = 10 
=-(A—S5)\A—2)A+ 1), 


so that the eigenvalues of A are 5, 2 and —1. 
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The eigenvector equations of A are 


(1 —A)x —2y =0, 
—2x+(2-—A)y—2z=0, 
—2y+(3-A)z=0. 


When A = 5, these equations become 


—4x —2y =0, 
—2x —3y —2z =0, 
—2y —2z=0. 


From the first equation, we have y = 
—2x; and from the third equation, we have 


zZ=~—y. So we may take as a corresponding 
1 
eigenvector | —2 |, which we normalise 
2 
1/3 
to have unit length as | —2/3 
2/3 
When A = 2, these equations become 
—x —2y=0, 
—2x —2z=0, 
—2y+z=0. 
From the first equation, we have x = —2y; 


and from the third equation, we have z = 
2y. So we may take as a corresponding 


—2 
eigenvector 1 , which we normalise 
—2/3 
to have unit length as 1/3 
2/3 
When A = —1, these equations become 
2x —2y =0, 
—2x+3y —2z=0, 
—2y+4z=0. 


From the first equation, we have y = x; and 
from the third equation, we have y = 2z. 


(c 


Near 
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So we may take as a corresponding eigen- 


2 
vector | 2 |, which we normalise to have 
1 
2/3 
unit length as | 2/3 
1/3 
1/3 —2/3 2/3 
If we then take P=} —2/3 1/3 2/3 
2/3, 2/3 1/3 


we have 


5 0 0 
P7AP=[0 2 O |, 
00 -1 


where P is an orthogonal matrix. 

Thus the transformation of coordinates 
given by x = Px’ or x’ = P*x trans- 
forms the projective conic with equation 
x? — Axy + 2y* — Ayz + 327 = 0 (that is, 
x’ Ax = 0) onto the projective conic with 
equation 


5(x’)? + 2(y')? — (z')* =0 


(that is, (Px’)’ A(Px’) = 0 or (x’)? (P7 AP) 
x’ = 0). 

Next, the transformation of coordinates 
x’ +> x” given by x” = Bx’, where 


G5 0 © 
B=| 0 v2 0], 
Oo © 1 


transforms the equation of the projective 
conic into the equation (x”)* + (y”)? — 
(2)? = 0. 

Thus, after dropping the dashes, it fol- 
lows that we can map E onto the projec- 
tive conic with equation x7 + y? = 2? 


> 
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by the projective transformation [x] ++ Chapter5 


[BP’ x], where 


vs Oo O 
BP’ = 0 2 0 
0 oO 1 


is <6 23 

o1 975. 13 28 
2/3 2/3 1/3 
J5/3 —2/5/3 2/5 /3 


—2/2/3  J2/3 22/3 
2/3 2/3 1/3 


This is a matrix associated with the desired 
projective transformation. 


Section 4.5 


1. 


The statement of the Five Points Theorem (Sub- 
section 4.3.1, Theorem 1) is follows: 

There is a unique non-degenerate projective 
conic through any given set of five Points, no 
three of which are collinear. 

To make it easier to dualize this result, by 
breaking the sentence up into shorter portions, 
we first rephrase it in the following way: 

Given any five Points, 
no three of which are collinear, 

there is a unique non-degenerate projective 

conic through the five Points. 

We first interchange ‘Point’ and ‘Line’, and 
‘collinear’ and ‘concurrent’; this gives the state- 
ment: 

Given any five Lines, 
no three of which are concurrent, 

there is a unique non-degenerate projective 

conic through the five Lines. 

Next, we make the changes necessary for this 
statement to make sense; it becomes: 

Given any five Lines, 
no three of which are concurrent, 

there is a unique non-degenerate projective 

conic that is tangential to the five Lines. 

This is the result dual to the Five Points 
Theorem. 


Section 5.1 


1. The centre of inversion is the origin (0, 0) and 


the radius of the circle of inversion is |. In each 

case, the point and its inverse lie on the same 

half-line from O to infinity. 

(a) Since 4 x i = |, and since (4,0) and 
(4.0) both lie on the positive x-axis, it 
follows that (3. 0) is the inverse of (4,0). 

(b) Since | x 1 = 1, it follows that (0, 1) is its 
own inverse. 

(c) Since \-4] x |—3] = 1, and since (0. -) 
and (0,—3) both lie on the negative y- 
axis, it follows that (0, —3) is the inverse 
of (0. -t). 

(d) Since i x 4 = 1, and since (4.0) and 
(4,0) both lie on the positive x-axis, it 
follows that (4, 0) is the inverse of (} 0). 


. We may use the formula for inversion in @ 


given in Theorem 2. 
(a) The image under inversion in @ of the 
point (4, 1) is 


4 1 41 
i eae =(; 4). 


7? 
is 
1 
2 


OO OO 


(b) Similarly, the image of ( -} 


ll 
tn| oo 
| 
Us 
eeu 


. Let € be the line with equation y = 2x 


punctured at the origin. By the strategy above 
Example 2, the image of @ under inversion in 
@ is the curve whose equation is 
y 2x 
x24 y2 x24 y2" 
We may rewrite this equation as y = 2x. Just 
as the origin had to be excluded from the line 
before we could find its image, so the origin 
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4. 


5. 


has to be excluded from the image. It follows 

that the image of @ under the inversion is @ 

itself (punctured at the origin in each case). 

(a) It follows from the strategy above Exam- 
ple 2 that the image under inversion in @ 
of the line £ with equation x + y = | is the 
curve whose equation is 

x yy 
ety | x24 y2 : 


We may rewrite this equation in the form 


xty=x7+y’ 


By completing the square, we may write 
this as 


is iy 4 4 
(*-3) +(¢-3) =2 @ 


This is the equation of a circle that passes 
through the origin, so the required image 


is the circle with centre ( 3) and radius 


wo punctured at the origin. 
The points (1,0) and (0, 1) lie on £, since 
their coordinates satisfy the equation of @. 
Since the two points lie on @, they are 
unchanged under inversion with respect to 
@. Hence they both also lie on the image 
of ¢ under the inversion. 
The image of @ is the circle with equa- 
tion (1), punctured at the origin. From part 
(b) this image passes through the points 
with coordinates (1,0) and (0,1). It fol- 
lows that the image is uniquely determined 
by the three points (0,0), (1,0) and (0, 1), 
as shown below. 


(b 


wm 


(c 


a 


image of € 


The circle with centre (2,2) and radius 1 has 
equation (x — 2)* + (y — 2)? = 1’, which we 
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may rewrite in the form 
x? 4 y?—4x —4y +7=0. 


Using the strategy above Example 2, we 
deduce that the image of this circle under 
inversion in @ has equation 


2 2 
cee ee ees ah 
(=) (a) ee a 


a 
x2 + y2 So 


We may add together the first pair of terms and 
the second pair of terms to obtain the equation 


x2 + y? 


We may rearrange this in the form 
14x -4y +7(x? +y’) = 0, 


or 


xe ge a, 
eee ee ee, oie. 


By completing the squares, we obtain 


aye A 2\° 1 
ae a) aes 
This is the equation of a circle with centre 
(5. ;) and radius i. 
yA 


image 
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6. The circle with centre (0.-4) and radius 7 


‘i iy 
has equation x* + (v + 1) = (4) , which 
we may rewrite in the form 


1 
x+y +sy=0. 


Using the strategy above Example 2, we 
deduce that the image of this circle under 
inversion in @ has equation 


‘ 2 i 2 ly 
2 
a =0. 
(si) ‘a TP4yy 


We may add together the first pair of terms to 
obtain the equation 


which we may rewrite as 
y= -2. 


This is the equation of a horizontal line with 
y-intercept —2. 


image 


of C 


It is clear from the figure that each point of 
this line is the image of some point on the 
punctured circle C — {O}. 


7. (a) From the summary box before Example 5, 


we know that € maps to a circle punctured 
at the origin. This circle passes through the 
point (0, 1) since this point is fixed by the 
inversion. Since £ is symmetrical about the 
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y-axis, it follows that the image circle must 
also be symmetrical about the y-axis. The 
only circle that fulfils all these criteria is 


the circle with radius 5 and centre (0. 3). 


(0,1) gL 


(0,0) 


(b 


ma 


From the summary box before Example 5, 
since the circle C passes through the origin 
we know that the image of C is a line that 
does not pass through the origin. It must 
be symmetrical about the y-axis (because 
C is), and it must pass through the point 
(0. 3) (the image of (0, 2)). The only line 
that fulfils all these criteria is the line y = 


ak 


8. We follow the proof of Theorem 5 as far as 


possible, making the necessary modifications 
to the line that passes through the centre of 
inversion O. 

Let £1 be the line that passes through O, and 
let the second line £2 meet ¢; at A. 

Under the inversion, €; — {O} maps to itself, 
and £2 maps to some circle C2, punctured at O. 
Then C2 meets £; at O and at the point A’, the 
image of A under the inversion. 
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By the Symmetry Lemma, £2 is parallel to 
the tangent m2 to C2 at O. It follows that the 
angle from £1 to £2 must be equal in magnitude 
and direction to the angle we have shown from 
£; to m2 at O. 

Next observe that the reflection in the per- 
pendicular bisector of OA’ maps ¢ to itself 
and C> to itself, and sends the tangent mz at O 
to the tangent 72 at A’. Since the reflection pre- 
serves the magnitude of an angle but changes 
its orientation, we conclude that the angle from 
£, to nz at A’ must be equal in magnitude but 
opposite in orientation to the angle from £1 to 
m7 at O. 

Overall, we have shown that the angle from 

£, to C2 at A’ must be equal in magnitude but 
opposite in orientation to the angle from £; to 
fy at A. This completes the proof. 
By the conclusion to Example 6, the image 
under inversion of each punctured circle touch- 
ing the x-axis is a line parallel to the x-axis 
(that is, a horizontal line). 

Similarly, the punctured circles that touch 
the y-axis map to lines parallel to the y-axis. 

It follows that, under the inversion, the 
circles map to horizontal and vertical lines. 
Indeed, the only horizontal and vertical lines 
that are not images of circles are the x- and 
y-axes. 


ev 
*Y 
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10. Let € be the line through O and A. Under inver- 
sion in the unit circle, £ — {0} maps to itself. In 
particular, the point A on @ maps to some point 
A’ on €. Since all the circles pass through A 
and are punctured at O, their images under the 
inversion must be straight lines that intersect 
at A’. 


11. 


In fact, apart from @, every straight line 
through A’ must be the image of one of the cir- 
cles in the family. For, as shown below, if a line 
£; is at an angle @ from @, then, by the Angle 
Theorem, it must be the image of the (punc- 
tured) circle Cj that is at an angle —@ from @. 


NL 
N 


Since the circles C; and C2 pass through the 
origin, it follows from Theorem 4, part (b), that 
their images under inversion are two lines, 1 
and £2. Next observe that the common tangent 
to C; and C2 at the origin maps to itself. Since 
this tangent and the circles Cy and C2 meet 
only at the origin, it follows that their images 
do not meet in C. In other words, €; and £2 are 
both parallel to the tangent. 
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Let A and B have images A’ and B’, respec- 
tively, under the inversion. Then it follows 
from Theorem 4, part (a), that the image of C3 
under inversion is a circle C4. Since A and B 
lie on C3, A’ and B’ must lie on Cy 

Since angles are preserved (in magnitude) 
under inversion, the images of tangential 
curves (which may be considered as curves 
intersecting at a zero angle) are also tangential 
curves. Hence C4 must be tangential to ¢ at 
A’ and to € at B’. 


Section 5.2 
1. (a) With z} = 2 — 3i and z2 = —3+ 4i, we 


have the following. 


G@) z1+z2=—I+i. 

Gi) 2] —Z2 =5-Ti. 

(iii) zpz2 = (2 — 3i)(—3 + 4i). 
= -64+ 8+ 91+ 12 
=6+ 17i. 


: 21 — 2-31 
(iv) Here DT DHA 


Multiplying the numerator and 
denominator by the complex conju- 
gate of —3 + 47, we obtain 

21 (2 — 3i)(—3 — 41) 


zz. (—3+4i)(—3 — 4i) 


_ =6=— 81+ 91 = 12 
32 4 42 
= 55(-18 +i). 
(v) 7 =24+3i. 
(vi) Z2 = —3 —4i. 
(b) Here 


lzi| = ¥2? + (-3) = V13 


and 


Izo] = V(—3)2 +42 = V25 =5. 


2. We can write z} = r(cos@ + isin@) where 


r=lal|=Vv124(-12 = v2 
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and 


— tL ing=—+ 

cosé = Fi and sind = Ta 

so that the principal argument is 0 = —7/4. 
Hence the required polar form of z} = | —i is 


J/2(cos(—1/4) + i sin(—7/4)). 


Similarly, we can write z2 = r(cos@ +i sin@) 
where 


r= lal =(-v3) +P =v4=2 


and 


cos@ = = and sind = 7 

so that the principal argument is 6 = 5z7/6. 
Hence the required polar form of z2 = —/3+ 
i is 


2(cos(5z/6) + i sin(5z/6)). 


. From Problem 2 we know that 


z1 = V2 (cos (2) +isin(-4)) 


and 


(a) Using the strategy of multiplying moduli 
and adding arguments, we obtain 
mz Sa 
= 2/2 ae eae 
2122 (cos ( ri + 6 ) 
+ isi TU i 5 
isin{| —— + — 
4+ 6 
a] 7 
=2V2 (cos (4) +isin (4)) : 
(b) Using the strategy of dividing moduli and 


subtracting arguments, we obtain 


Z1 af x 5n 
= COs 
22 2 4 6 
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4, (a 


(b 


(c 


) 


wm 


Nee 


where in the last line we have added 27 to 
the argument to obtain its principal value. 


The coefficient of z has modulus | — i| = 
1. By Theorem 1, it follows that ¢ is an 
isometry. 

In the formula that defines t, the multi- 


plication by —i corresponds to a rotation 
about the origin through the angle —z/2, 
and the addition of 6 — 4i corresponds to a 
translation through the vector (6, —4). 


n/2 


6 
The rotation through —z/2 corresponds to 


the composite r2 o r; where 


r1 is the reflection in the x-axis, 
r2 is the reflection in the line y = —x. 


(Here y = —x is the line through the origin 
that makes an angle —7:/4 with the x-axis.) 


The translation through the vector 
(6, —4) corresponds to the composite r4or3 
where 


r3 1s the reflection in the line 
6x —4y = 0, 

rg is the reflection in the line 
6x — 4y = 26. 


(Here 6x — 4y = 0 is the equation of the 
line through the origin that is perpendicu- 
lar to the vector (6, —4), and 6x — 4y = 26 
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is the equation of the parallel line that 
passes through the midpoint (3, —2) of the 
position vector of (6, —4).) 


Overall, t = rgor307r2 ofr}. 


5. Here we use the formula for inversion of points 
in @ given by Theorem 3. 


(a) The image of —J/3 +i under inversion in 


@ is 
i. 4 
—/34i  —V3-i 
—VJ/3 +i 
“0840 


= }(-v3 +i). 
(b) The image of —3 — 47 under inversion in 
6 is 
1 2 1 
3-4 —-34+4i 

= —3-4i 

~ (-3 + 4i)(-3 — 47) 

= —75(3 + 4i). 


6. By Theorem 3, with r = 2 and c = 0, the 
inversion is given by 


t(z) = : (ze C—{O}). 


NX 


7. (a) Here we use part (a) of the definition of 


inversion. By Problem 6, the inversion can 
be written in the form 

+, ifzeC— {0}, 
if z= 0, 


0, ifz=o. 


t(z) = 4 w, 
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(b) In this case we use part (b) of the def- 
inition of inversion. Under reflection in 
the real axis, a point z € C maps to Z. 
The inversion can therefore be written in 


the form 
z ifzeCc, 
t(z) = : 
oo, ifz=o. 
8. Here 
:, if¢eC—{O}, 
n(z)=40, ifz=0, 
0, ifz=w, 
and 
:, if¢eC—{O}, 
n(z)= 40, ifz=oo, 
oo, ifz=—0. 
Hence 


(Co) = f2 0 t1 (00) = (0) = on, 
t(0) = f2 01, (0) = h(cwo) = 0. 
For the remaining values of z € C — {O}, we 
have 
4 
=== =4 
(1/Z) 
It follows that t = fo 0 fy is the function 


4z, ifzeC, 
t(zZ) = 
oo, ifz=o. 


1 
tzj=hon()=h (<) 
z 


This fixes the point at infinity, and scales 
elements of C by the factor 4. 

If the circle of radius 2 were replaced by 
a circle of radius Vk, then tf would fix the 
point at infinity and scale elements of C by the 
factor k. 
. In addition to mapping oo to oo, the trans- 
formation ft scales the complex plane by the 
factor | — 9| = 9, rotates it through the angle 
Arg(—9) = mz, and then translates it through 
the vector (6, —10). 
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By the last part of Problem 8, the scaling 
by the factor 9 can be decomposed into the 
composite fz o t; where 


t; is the inversion in the unit circle @, 
ta is the inversion in the circle of radius 3 
centred at the origin. 


ay, 


The rotation through the angle z can be 
decomposed into the composite t4 0 t3 where 


tz 1s the inversion in the extended x-axis, 
t4 1s the inversion in the extended y-axis. 


(The y-axis makes an angle 2/2 with the 
X-axis.) 


The translation through the vector (6, —10) 
can be decomposed into the composite f6 0 f5 
where 


ts is the inversion in the extended line €5 U 
{oo}, where @s is the line 6x — 10y = 0, 

te is the inversion in the extended line 6 U 
{oo}, where £6 is the line 6x — 10y = 68. 


(Note that €5 is the line through the origin 
that is perpendicular to the vector (6, —10), 
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and f¢ is the parallel line that passes through 
(3, —5).) 


6x—10y =0 


6x —10y = 68 


@ (6, -10) 


Since #6, t5, 4,3 and ft2 o t; all map & to 
itself, it follows that tf = t60 fs ot4o 20 f of. 
10. We use the following formulas from the proof 
of Theorem 6 of Subsection 5.2.4: 
2x 2y 
ee ee ee 
oy? =] 
= 4 year 


(a 


wm 


Substituting for X, Y and Z from the above 
formulas, we get that the image under 
stereographic projection onto € of the cir- 
cle on S with equation X = 5 is the set in 
the plane for which 
2x 1 

x2+y24+1 be 
that is, 

Ax =x*+y*4+1 
or 

(x — 2)? + y? =3. 
Thus, the image under the stereographic 
projection of the circle on S is the circle in 
the plane with centre (2,0) and radius J/3. 
Substituting for X, Y and Z from the above 
formulas, we get that the image under 
stereographic projection onto € of the cir- 
cle on S with equation 3X + 2Y + Z=1 
is the set in the extended plane for which 


(b 


wm 


6x ee 4y 
ao le a ae | 

Ce a cee | = 

x2+y24+1 7 
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that is, 
6x +4yt2x27+y?-l=x?4+y?4+1 


or 
3x+2y—-1=0. 


Thus, the image under the stereographic 
projection of the circle on S is the extended 
line in the extended plane through the 


points (0. 5) and ( 0). dt is the 
extended line in C since N maps onto oo.) 
Section 5.3 
1. In each case we have to determine whether the 
formula has the form 
az+b 
MZ) = ; 
cz +d 


where a,b,c,d € Candad — bc #0. 

(a) This is a MGbius transformation. It has the 
form (1), with a = 0,b = 5,c = 1 and 
d = 0. Also, 


ad—bec=-5 #0. 


(1) 


By the convention, M(oo) = a/c = 0. 
This is a MGbius transformation. It has the 
form (1), with a = —1,b = 2i,c = 3 and 
d = —4i. Also, 


ad — bc = 4i — 61 = —21 £0. 


(b 


wm 


By the convention, M(oo) = a/c = —%. 
This is not a Mobius transformation. The 
transformation can be expressed in the form 

32° +i 
aS z+0 ° 
and so clearly cannot be expressed in the 
required form (1). 
This is a Mobius transformation. The trans- 
formation can be expressed in the form 
woe ar _ Z+ oe 

Z+2i Z+2i 

This has the form (1), with a = 
5+ 2i,c = 1 andd = 21. Also, 


ad — be = 21 — (5+ 21) = —5 £0. 


(c 


Nee 


(d 


wN 


By the convention, M(oo) = a/c = 1. 
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2. (a) All matrices associated with M, are non- 
zero multiples of 


(09) 


Comparing the zero entries of A with 
those of Aj, Ao and A3, it is clear that A> is 
the only matrix of the three that could be a 
multiple of A. 

In fact, Ag = —iA, so the matrix Ao is 
associated with M). 

All matrices associated with M> are non- 
zero multiples of 


0 2i 
he ( ak 

Comparing the zero entries of A with 
those of A;, Az and Az, it is clear that Ay 
is the only matrix of the three that could be 
a multiple of A. 

In fact, Ay = aA: so the matrix Ay is 
associated with M>. 


All matrices associated with M3 are non- 
zero multiples of 


(b 


wm 


(c 


wm 


Since each of the matrices A;, Az and A3 
has at least one zero entry, whereas A has no 
zero entries, it is clear that none of A;, Ao 
and A3 is a multiple of A. 

It follows that none of the matrices 
Ay, Ao and Az is associated with M3. 

3. Matrices associated with the M,; and M) in 
Example 2 are 


i 1 1 i 
Aj => (; 3) and Ad => (; J) 7 


respectively. So 


. ey: oe 
Aida= (5 5) (; = 
_fi+i 2 
~\ 2 242i)° 


This is a matrix associated with M, o Mp. 
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4. As in Example 3, matrices associated with the 
Mobius transformations M, and M) are 


3 1 2i 3 
a= (j 5) and a= (7 2) 


(a) It follows that a matrix associated with M20 
M, is 


me ata A 

adr = (7 3) (; 5) 
_( 9% 642i 

~ \3-—2i 5 , 


9iz-—6+2i 
3—2i)z+5 
(b) A matrix associated with M; o M, is 


. Hf i 
adi =(; 5) (; 3) 


so that 


Mz o M,(z) = 


so that 

(9+i)z+1 

iz+(4+i)’ 

5. Matrices associated with the Mébius transfor- 
mations M, and M) are 


1 -i 2 ot 
A\ => (; ;) and Ad => « i} 


A matrix associated with M, 0 M2 is therefore 


ma=(7 9)(4 )=( 0): 


so that 


M,oM\(z) = 


M, o Mo(z) = z. 


6. Here we use the formula for the inverse of a 
Mobius transformation given in Theorem 7. 


_ —3iz-2  3iz+2 
M (2) = = 
ON ST ape eat 
4 4i i 
(b) M, Oe 
—4iz 4iz 
M3'(z) = = ; 
NS Sea aA 
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7. (a) To keep the calculations simple, we pick the 


e8 


points oo, 27 and 2 on the extended line, as 
shown below. Now 


t(oo) =1, 142i) =0 


and 
(2) = =4%=5-3i. 
So the image of the extended line is the 
generalized circle that passes through the 
points 1, 0 and 5 = si. This is the circle of 


radius 5 with centre ' 


2 


(b) In this case, we pick the points —2,0 and 


—1-+i on the circle, as shown below. Now 
t(-2)=o, ¢t(0)=-i 


and 
(je ees 
t(-l+i)= ivi =—-l. 
So the image of the circle is the gener- 
alized circle that passes through the points 
oo,—i and —1. This is the extended line 


with slope —1 through the point —1. 


-l+i 


Section 5.4 
1. We use the preceding strategy. 


(a) The required MObius transformation is of 


the form 


(b 


(c 


(d 


wm 


Nee 


wm 
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for some complex number K. 
Since M(—3) = 1, we must have 


so that K = 3. It follows that the required 
Mobius transformation is given by 


ACs 


Ms) = 2z 


The required Mobius transformation is of 
the form 


3 

ae) 
M(z) = K —., 
Z=1 


for some complex number K. 
Since M(2) = 1, we must have 


so that K = 2. It follows that the required 
MoObius transformation is given by 


z-3 22-3 


C=? 4 


z-1- 
The required Mobius transformation is of 
the form 
M@=— 
= oo) 
for some complex number K. 
Since M(—3) = 1, we must have 


K 


i= ——, 
= 


so that K = —5. It follows that the required 
Mobius transformation is given by 
5 5 


Mey =— 
@) z—2 2-Z 


The required Mobius transformation is of 
the form 


M(z)=K (2-3), 


for some complex number K. 
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Since M(2) = 1, we must have 
3 
1=K(2-3)=3K, 


so that K = 2. It follows that the required 
Mobius transformation is given by 


M() =2(z- 3) =22-3. 


2. We use the strategy preceding Example 2. 


(a) The Mobius transformation M, that maps 
—1,i,1 to 0,1,00, respectively, is of the 
form 

2=(-D 241 


M —K =K ; 
1@) z—-1 z—1 


for some complex number K. Since 


M\(i) = 1, we must have 
i+] 

eg allen 

i-1 


so that 


i=l @-NCiF) _ 2 
gil GLH) 2 


=i. 


It follows that 


z+1 


M,(z) =i : 
z-1 


The solution to Problem 1, part (a), shows 
that the Mobius transformation that maps 
—1,—3,0 to 0,1, 00, respectively, is given 
by 
3(z + 1) 

2z 


Now by Theorem 7 of Subsection 5.3.6, 


M2(z) = 


3 


M>'(z) = . 
2 @) 2z—3 


Now matrices associated with Mj, and 


—1 
M, are 


(b 


wm 
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so that a matrix associated with M, ‘om 1 iS 
0) 3 i i 
2 -3/\1 -l 


7 3 3 
~ \-342) 342i) 


Hence the required Mobius transformation 
is given by 
M(z) = M5! 0 Mi(z) 

_ 3z.=3 

~ (-3 + 2i)z + (342i) 
The Mobius transformation M, that maps 
3,00, —2 to 0, 1,00, respectively, is of the 
form 


z—3 
z+2) 
The Mobius transformation M2 that maps 


M,(z) = 


33 i, 1 to 0, 1, ov, respectively, is of the form 
£9 
M2(z) = K—,, 
z—l1 


for some complex number K. Since M2 (2) 
= 1, we must have 


1-3 1 
l= Ke =-—-K, 
gol 2 
so that K = —2. It follows that 
z—-3 -—2z+6 
Mo(z) = —2 = ' 
z—1 Z= 1 
Using the formula for inverses we have 
= —z-6 7+6 
M5'(z) = = ‘ 
2 @) =f 2 g2 


Now matrices associated with M, and 


= 
M, are 


1 =3 1 6 
(1 2) + (i 3): 


so that a matrix associated with M, ‘om 1 


is 
1 6\/f1 -3 7 9 
(; ( =( ‘7 
Hence the required Mobius transformation 
is given by 
7z+9 


M(z) = My! 0 M\(z) = : 
(z) > 0 Mi(z) api 
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3. We use the preceding strategy. so that 


(a) First we determine the Mobius transforma- 2i 


wm 


tion M that maps 0,—4,—2i to 0,1,0, 
respectively. Following the usual strategy, 
we observe that this transformation must be 
of the form 


z—0 
M(z) = K ——, 
@) z+2i 


for some complex number K. Since 
M(—4) = 1, we must have 


-4 


f=. — a 
—4+42i 


so that K = 1— zi. Thus the transformation 
M is given by 

7 (1-4i)< 

"ego 


It follows that 


( 2 si) (—1—3i) 


M(-1-—3i)= 
-l-i 

_=o-9 _ 3 

= pr ee 


Since this is a real number, it follows that 
the four points 0,—4, —2i, -1 — 37 lie on 
a generalized circle. In fact, they lie on a 
circle because M(oo) = 1 — zi is not real. 
First we determine the Mobius transfor- 
mation M that maps —1,—i,i to 0,1,«, 
respectively. 

Following the usual strategy, we observe 
that this transformation must be of the form 

ieee ea 


Y ile | 


for some complex number K. Since 
M(—i) = 1, we must have 
-i+1 
—2i 


1=K 


> 


Thus the transformation M is given by 


_ %e@+)) 
Ne eaiiipa dy 
It follows that 
MQ2—-i)= es 
(i — 2 — 2i) 
_86=) 1, . 
=e ge 


Since this is not a real number, it follows 
that the four points —1,—i,7,2 — i do not 
lie on a generalized circle, and so do not lie 
on a circle. 


Section 5.5 
. Let P be a point (x,y) in the plane whose 


distance from the point (0, —1) is k times its dis- 
tance from the point (0,2). Then, if we use the 
Euclidean formula for distance between points 
in the plane, it follows that 


x? + (y+ 1? =P (x? + (y — 2)”). 


For each value of k, this yields an equation for 
the corresponding Apollonian circle. The point 
(1, 1) lies on the Apollonian circle for which 


P+04+)? =? + (0-2), 


. 5 
that is, k = nee 


The equation of the Apollonian circle through 
the point (1, 1) is therefore 


P+ (y+ = $(x?+(-2), 
which simplifies to 
—3x? - By? + 12y-—9=0, 


or 


x? + y?—8y+6=0. 
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We use the definition of circles in an Apollo- 
nian family. Let the family have point circles 
A(—a,0) and B(b, 0); we take these to lie in the 
open intervals (—18,—2) and (3,12), respec- 
tively. 

Since the points (—18,0) and (—2,0) lie on 
the same circle in the family, the ratios of their 
distances from the points A and B are equal. It 
follows that 

18-—a a-2 


b+18  b+2’ 


so that 


(18 — a)(b + 2) = (a — 2)(b + 18); 
after multiplying this out and simplifying, we 
get 
2ab + 20a — 20b — 72 = 0. (1) 
Next, since the points (3,0) and (12, 0) lie on 
the same circle in the family, the ratios of their 


distances from the points A and B are equal. It 
follows that 


3+4 _ 12+ a 


so that 
(3 + a)\(12 — b) = (12 + a)(b — 3); 
after multiplying out and simplifying, we get 
2ab — 15a+ 15b—72=0. (2) 
We now solve equations (1) and (2) for a and 


b. If we subtract equation (2) from equation (1), 
we get 


35a — 35b = 0; 
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in other words, b = a. Substituting for b into 
equation (1), we get that 2a” — 72 = 0; so that 
a= +6. 

Since —a € (—18, —2), by assumption, it fol- 
lows that a = 6. Finally, since b = a, we have 
that b = 6. It follows that the point circles in the 
Apollonian family are (—6, 0) and (6,0). 

Then, using the Euclidean formula for dis- 
tance between points in the plane and the def- 
inition of an Apollonian circle, we deduce that 
the general form of equations of circles in this 
Apollonian family is 


(x+6)?+y? =? (@-6? +5’). 
The point (6, 9) lies on that circle for which 
(6+6+9% =k (6-6) +9"), 


from which we get that k> = 25/9. The equation 
of this particular Apollonian circle is therefore 


(+6 +y? = B (67 +"), 


which after multiplying out and simplifying 
becomes 


4x? + 4y* — 102x + 144=0. 


Remark 

Another method of determining this last equa- 
tion is as follows. Let P be the point (6,9). Then 
PA* = (6+ 6)? + 9? = 225, so that PA = 15, 
and PB* = (6 — 6)? + 9” = 81, so that PB = 9. 


BP? = (6—6)° +97 =81, 


so that BP = 9. 

Then the ratio PA:PB equals 15:9, or 3:1. 
Hence, if we use the Euclidean formula for 
distance between points in the plane and the 
definition of an Apollonian circle, we deduce 
that the equation of the circle in this Apollonian 
family through the point (6, 9) is 


(«+62 +y? = F(@-67 +)’). 
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CLA T x 


First, we verify that there exists a circle, Cy,4 
say, with centre A that intersects C, at right 
angles. 

Let the two tangents from A to C, touch C, 
at the points U and V, say. Then the circle Cy, 
with centre A and radius AU intersects C,, at 
right angles since AU and AV are tangential to 
Cn and are radii of Cy, 4. 


Now invert the figure in Theorem 10 in the 
circle Cy,4. This inversion maps C,, to itself, by 
Corollary 3 of Subsection 5.5.3, since C,,4 and 
C,, intersect at right angles. 

Since S; and S» are circles tangential to Cy, 
their images under inversion are generalized cir- 
cles tangential to the image of C,, - which is just 
C,, itself. Further, since S; and Sj pass through 
the centre A of the inversion, their images must 
pass through oo. Hence the images of S; and S 
must be extended lines tangential to Cy. 

Now, the inversion maps the positive x-axis 
to itself; and, since S; and Sz meet the pos- 
itive x-axis at right angles, their images must 
also meet the positive x-axis at right angles. It 
follows that the images of S$; and Sz must be 
vertical half-lines that are tangential to C;. 

The other circles in the chain also invert to 
circles that are tangential to these two verti- 
cal half-lines, and each successive circle in the 
image also touches the previous and following 
circles in the chain. 


4. 
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Similarly, the image of $3 is the semicircle in 
the upper half-plane with diameter C’ B’. 

Since all the circles in the image of the chain 
and the semicircle image of 53 touch both verti- 
cal lines, they must all have the same diameter — 
namely, the diameter of C,,. It then follows that 
the height of the centre of C,, above the x-axis 


= radius of image of 53 
+(n—1)- diameter of C, + radius of C, 
=n - diameter of Cy, 


as required. 


F'($e.0) 


The points F (50.0) and F’ (56 0) are the 
centres of the circles Sj and S2, respectively. 

Let S be any circle in the chain constructed 
in Theorem 10, and U its centre. Let V be the 
point of contact of $; and S. Since Sj and S have 
a common tangent at V and their radii FV and 
UV are perpendicular to this tangent, the points 
F, V and U are collinear. 

Next, let W be the point of contact of S2 and 
S. Since Sz and S have a common tangent at 
W and their radii F’W and UW are perpendicu- 
lar to this tangent, the points F’, U and W are 
collinear. 

It follows that 


FU + F'U = (radius of S; + radius of S) 
+ (radius of S2 — radius of S$) 
= radius of $; + radius of $2 
=4(b+c). 


Hence (by Theorem 5 of Subsection 1.1.4) U 
lies on (the upper half of) an ellipse whose sum 
of focal distances is 5(b +c). 
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From the figure, it is clear that one end 
of the major axis of this ellipse must be the 
point A(0,0). Since (by Theorem 5 of Subsec- 
tion 1.1.4) the length of the major axis of the 
ellipse is the sum of the focal distances, it fol- 
lows that the other end of the major axis must 


be the point (50 +c), 0), as required. 


Chapter 6 
Section 6.1 
1. 


From the figure, 2; and £3 are d-lines; £2 is not. 
(€; and £3 meet @ at angles; £2 does not.) 
2; 


A 


Intersections (in Y) : €; and £2, 2; and £4. 
Parallel: €; and €3. 
Ultra-parallel: 22 and £3, &2 and £4, €3 and £4. 
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C 


One solution is shown. The d-lines ¢; and £2 are 
parallel because they (are parts of generalized 
circles that) have a common boundary point, 
A. The d-lines €2 and £3 are parallel because 
they (are parts of generalized circles that) have a 
common boundary point, B. But ¢; and €3 meet, 
at O, and so are not parallel. 


. By the Origin Lemma, there are hyperbolic 


transformations 7; and r2, say, that map A; and 
A2, respectively, to the origin O. Since Gg is 
a group, it follows that the composite mapping 
r=ry ‘or, is alsoa hyperbolic transformation; 
moreover, r maps A, to Ao, as required. 


. We use the Origin Lemma to obtain a hyperbolic 


reflection r that maps the point P to the origin, 
O, and the d-line £ to some d-line ¢’. Suppose 
that the boundary points of @ are A and B, and 
the boundary points of ¢’ are A’ and B’, where 
r(A) = A’ and r(B) = B’. Then the diameters 
from A’ and B’ are the only two d-lines paral- 
lel to ¢’ through O. The reflection r~! sends O 
to P, and A’ and B’ to A and B, the boundary 
points of £, so it sends the diameters from A’ and 
B’ to d-lines through P that are parallel to €. 


To see that the images under r~! of these 
diameters are the only such d-lines, suppose that 
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£” is a d-line through P and A, and so parallel 
to £. Then r(€”) is a d-line parallel to ¢’ through 
O and so must be the unique diameter of F 
through A’. It follows that there is a unique d- 
line through P and A, as required. (Similarly, 
there is a unique d-line through P and B.) 


Section 6.2 
1. Reflection in the d-line obtained from a is 
given by 
az—1l 
pZ)= =~, 
Z-@ 


so p(z) = O if and only if z = 1I/a, that is, if 
and only if z = 1 /a. 

Let a point A € Y be represented by the com- 
plex number f. Then it follows from the above 
result that the reflection in the d-line obtained 
from z = 1/8 maps A to the origin. (As a check, 
we note that this reflection is given by 


_ G/p)z-1 
p1(Z) = —— 
z—(1/B) 
and that 
1/B)B —1 
pi(B) = aiid = 0.) 
B-(1/B) 


Thus we have proved the Origin Lemma. 
. Since M(z) = gett is a Mobius transformation, 


we have aa—bb = |a|?—|b|? 4 0. If M(0) = 0, 
then (since a and b cannot both be zero) 


a-0+b=0, 


which implies that b = 0. 

. Because the direct hyperbolic transformations 
we seek map 0 to 0, they must be (Euclidean) 
rotations about 0. So they are of the form 


M(z) = Kz, 


where K = cos 6+i sin6, 6 being the anticlock- 
wise angle of rotation. 

Now the equation y = x/+/3 can be rewrit- 
ten in the form y = x tan Z, and the equation 
y = 3x can be rewritten as y = x tan 7 
So the two rotations we seek must be through 
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angles of 2/6 and —5z/6. Thus M(z) = Kz, 
where 


K =cost +isint =}(v3+i) 
or 


K =cos (—3£) +isin(—) = —4(/3+i). 


. It follows from Theorem 4 that the direct hyper- 


bolic transformations with these properties are 


as follows. 
(a) M(z) =K aa = K$51, where|K| = 1. 
ge e—(- 1/3421 /3) 
(b) M@) = K p37 z = 
where |K| = 1. 


3¢-+1—2i 
3+(-2i)2’ 


. The general direct hyperbolic transformation M@ 


which maps ; to 0 is 


3 
z—3 
M(z)=K ~ (by Theorem 4) 
1— 4z 
Kz- 2K 
7 —fc+1) 


Hence, by equation (2) in Subsection 6.2.2, the 
general form of the inverse of M is 


z+4K 


M"'(2) = 5 
qZtkK 


where |K| = 1. 


. From Theorem 4, the direct hyperbolic transfor- 


mation M is necessarily of the form 


=m 
= (1) 


va 
M(z)=K 
1—mz 


where |K| = l andme JY. 

Now, it follows from equation (1) that 
M(m) = 0; hence, since M maps (—1, 1) one— 
one onto itself, the number m must be real and 
in (—1, 1). Thus 


z— 
M(z)=K ; 
1—mz 


(2) 


where |K| = | andm e€ (—1, 1). 
Next, it follows from equation (2) that 
M(0O) = —Km. Since M(0) and m are real, it 
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follows that K is real; so K = +1. Thus M 
must be of the required form 
zZ—-m 
M(z)=+ , Wwherem é€ (—1,1). 
l—mz 


7. (a) We follow the strategy in Subsection 6.2.2. 
First, it follows from Theorem 4 that 

the general form of the direct hyperbolic 
transformation M, which maps —ti to 0 is 


z+4i 


1: 
1— 3iz 


M(z)=K 


’ 


where |K | = 1; a matrix associated with M, 


K lik 
ar=( 1: 3! ). 
—3l 1 


Also, by Theorem 4, the direct hyperbolic 
transformation 


= 
M2(z) = 


maps 2 to 0; a matrix associated with M2 is 


1 —2 
-} 1 


Now the inverse of Ap is 


Thus a matrix for the required hyperbolic 
transformation is 


1 2 K tik 
At'A 9 3 3 
anal DCG) 


II 
a) 


It follows that the general form of the 
required direct hyperbolic transformation is 


(x = 3i) zt (fix 4: 3) 
(3K —ti)2+(fiK +1) 


where |K| = 1. 


M(z) = ? (1) 
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(b) Now M(i) = 1if 
(x - gi)i+ (4iK+ 3) 


a (3K - si) i+ (Ji +1) 


This holds if we choose K such that Ki = 1; 
that is, if K =1/i= — i. It follows from 
equation (1) that the transformation we 
require is given by 


CH) 
9— lliz 
~ 11 — 9iz 


. We follow the strategy in Subsection 6.2.2. 


First, it follows from Theorem 4 that the gen- 
eral form of the direct hyperbolic transformation 
M, which maps 5 to 0 is 


gaol 
M\(z) = K—~., 
1—5z 


where | K| = 1; a matrix associated with M, is 


Then, it follows from the above discussion that a 
particular direct hyperbolic transformation that 
maps 5 to 0 is 


z- 
M2(z) = i 
1 ore 


and a matrix associated with Mp? is 


tf: ak 
-; 1 


Now the inverse of Ag is 


1 
= ae 
2 


NI- 
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Thus a matrix for the required hyperbolic trans- 
formation is 


= 1 14\(K -1K 
andl (4 1") 
2 2 


1 1 1 
(i ba 
3\1 1 1 

3K -—5 —qkK+1 


It follows that the general form of the required 
direct hyperbolic transformation is 


M(z) = 
($k -4)2+(-3« +1) 
_ (4K —1)z4+2(-K +1) 
~ 24K -Dzt+(-K +4)’ 
where | K| = 1. 


. (a) A direct hyperbolic transformation mapping 
si to 0 is of the form 


22-1 
Q+iz 


z—4i 
M(z)=K =k 
1+ 5iZ 


where |K| = 1. 
Now M(0) = —}Ki, so 


M(0)=5¢K =i. 


Hence 
2z-i 
Q+iz 


M(z) =i 


(b 


we 


A direct hyperbolic transformation mapping 
to 0 is of the form 


1 
£5 2¢ = 1 
M@)=K-—~=K 
1-22 2—Z 
where |K| = 1. 
acs 
Now M (3) = K3-3 = 4K, so 
=3 
MG) =3¢K =2. 


But K must have modulus 1, so no such 
direct hyperbolic transformation exists. 
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(c) A direct hyperbolic transformation mapping 
5(1 +i) to 0 is of the form 


2-30 +i) _ 3e- +i) 


M(z)=K = oy 
1-41 -ai)z 3=(l=2z 
where |K| = 1. 
Now 
1-i-(d+i 
Metaae Os 
“ a= (bar 
—6i 
=K ? 
9+ 2i 
so 
—6i 1 
Mid-i)=1sKkK == 
abo Das Saag 5 
9+ 2i 
K= . 
ad —12i 
But |K|? = area # 1, so no such direct 


hyperbolic transformation exists. 
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if 


We use the formula for the hyperbolic distance 
d(0, z). Thus 


d (0, ‘) = tanh! (3) 
~ 0.3466 
and 
d(0.8, 0.9) = d(0, 0.9) — d(0, 0.8) 
= tanh! 0.9 — tanh! 0.8 


~ 1.4722 — 1.0986 
= 0.3736. 


. We use the formula for the hyperbolic distance 


d(0, z). Thus, if d(0,a) = 1.6 anda > 0, then 


a = tanh 1.6 ~ 0.9217; 


and, if d(0,a) = 3.2 anda > 0, then 


a = tanh 3.2 ~ 0.9967. 
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3. Let m be the hyperbolic midpoint of [0.5, 0.8]. 


Now 
d(0,0.5) = tanh7! 0.5 ~ 0.5493 
and 


d(0,0.8) = tanh! 0.8 ~ 1.0986. 


Since m lies half-way between 0.5 and 0.8, 
measured in terms of hyperbolic distances, and 
0.5, m and 0.8 all lie on the same side of the 
origin, it follows that 


d(0,m) = +(d(0,0.5) + d(0, 0.8)) 
~ £(0.5493 + 1.0986) 
= 0.823 95, 


SO 


m ~ tanh 0.823 95 
~ 0.6772. 


Next, let m be the hyperbolic midpoint of 
[—0.2, 0.8]. Now, 


d(0, —0.2) = tanh! 0.2 ~ 0.2027 
and 


d(0,0.8) = tanh! 0.8 ~ 1.0986. 


a ®) 


0.2 0 m 0.8 


Clearly m lies between —0.2 and 0.8, on the 
same side of the origin as does 0.8. It follows 
that 


d(0,m) = 4(d(0,0.8) — d(0, —0.2)) 
~ $(1.0986 — 0.2027) 
= 0.447 95, 
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so 


m ~ tanh 0.447 95 
~ 0.4202. 


We use the first strategy in Subsection 6.3.3. 


1. The line through 0 and the hyperbolic centre, 
-i, of C meets C at a and b, where a and b 
are real numbers with a < b. Now 


d(0 ; tanh7! : 0.2554 
,—-— ] = tan —)~70. 
4 4 


Since a lies to the left of the origin and 
d(0,a) ~ 0.2554 + 0.5 ~ 0.7554, 
it follows that 
a =~ —tanh 0.7554 ~ —0.6384. 


Next, since the hyperbolic radius of C is 
greater than the hyperbolic distance of its 
centre from the origin, we have that b > 0. 
Thus 


d(0,b) ~ 0.5 — 0.2554 = 0.2446, 
so that 
b = tanh 0.2446 ~ 0.2398. 


2. The Euclidean centre of C is the Euclidean 
midpoint of [a,b], namely the point 


5 (0.2398 — 0.6384) = —0.1993. 
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3. The Euclidean radius of C is 


5 |a — b| = 4 |-0.6384 — 0.2398| 
= 0.4391. 


We use the second strategy in Subsection 


6.3.3. 


The Euclidean radius p is the point ip and 
the Euclidean radius is 7 so Op meets K at the 


points a = -j and b = 3. Thus 


d(0,a) = tanh7! (|-3)) 
4 


= tanh7! (0.25) ~ 0.255... 


and 


d(0,b) = tanh! (|-31) 


= tanh7!(0.75) ~ 0.973... 


Since a and b lie on opposite sides of O, the 


hyperbolic centre m of K is given by 


d(0,m) = 50. b) — d(0,a)) ~ 0.359, 


so that 


hyperbolic centre m ~ tanh 0.359 ~ 0.344; 


hyperbolic radius = 5|(d(0, a)+d(0, b)| ~ 0.614. 
6. (a) Using the Reflection Lemma with p = 0.8 


and q = 0.5, we find that 


_ 034+08-0.5-03 
= §.64 = O05 


wm 


~ 1.0769. 
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So the equation of the d-line ¢ which is 
the (hyperbolic) perpendicular bisector of 
[0.5, 0.8] is 

x? 4+ y? —2ax+1=0, (1) 
where a ~ 1.0769. 
We use the fact that angles and hyper- 
bolic lengths are unaltered under hyperbolic 
transformations; in particular, under reflec- 
tions in diameters of the unit disc and under 
rotations of the unit disc about the origin. 

The segment [—0.8, —0.5] is obtained by 
reflecting the segment [0.5, 0.8] in the diam- 
eter (—i,i), and so we can obtain its perpen- 
dicular bisector by also reflecting the d-line 
£ in the diameter (—i, 1). 

It then follows from equation (1) that the 
perpendicular bisector of [—0.8, —0.5] has 
equation 


x? +y?+4+2ax+1=0, 


where a ~ 1.0769. 

Next, the segment [0.5/, 0.87] is obtained 
from the segment [0.5, 0.8] by rotating the 
unit disc through an angle 2/2 anticlock- 
wise about the origin; and so we can obtain 
its perpendicular bisector by rotating the d- 
line £ through an angle of z /2 anticlockwise 
about the origin, that is, by applying the 
mapping (x, y) +> (—y,x). 

It then follows from equation (1) that 
the perpendicular bisector of [0.5i, 0.87] has 
equation 


x? 4+ y? —2by +1=0, 
where b ~ 1.0769. 


By the Origin Lemma (Subsection 6.1.2, 
Lemma 2), we may apply a preliminary 
hyperbolic transformation M to map A to 
O; this preserves d-lines and angles. 
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Then the image of the d-line ¢ under M 
is a diameter of Y, and it is obvious that 
there are exactly two diameters of Y that 
make an angle 6 with the diameter M(¢). By 
then applying the hyperbolic transformation 
M~—! (which preserves d-lines and angles), 
we deduce that there are exactly two d-lines 
through A = M~!(Q) that make an angle 6 


with @. 
(b) 
B 
(1 
——— 
ee 
uM 
D o 


By the Origin Lemma (Subsection 6.1.2, 
Lemma 2), we may apply a preliminary 
hyperbolic transformation M to map A to 
O; this preserves d-lines and angles. Let 
B’ = M(B) andC' = M(C). 

The images under M of the d-lines that 
contain the d-line segments AB and AC are 
diameters of Y, and it is obvious that there 
is a unique diameter ¢’ of that bisects the 
angle 7 B’OC’. By then applying the hyper- 
bolic transformation M~! (which preserves 
d-lines and angles), we deduce that there is 
a unique d-line £ = M —!(€’) that bisects the 
angle ZBAC. 

Clearly reflection in ¢’ of the diameters of 
Q that contain OB’ and OC’ map these two 
diameters onto each other. By then apply- 
ing the hyperbolic transformation M~!, we 
deduce that reflection in @ maps the d-lines 
containing BA and CA onto each other. 
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1. Let AABC be any d-triangle. Denote the angles 
of the d-triangle by a, 6 and y, respectively, as 
shown, and let 6 denote the external angle of the 
d-triangle at A. Thus, since the angle between 
two d-lines is equal to the angle between their 
tangents (see the definition of the angle between 
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two curves in Y given at the end of Subsection 
6.1.1), wehavea+0=7. 


Now, by Theorem | in Subsection 6.4.1 for 
the d-triangle AABC, we must have 


a+Bt+y<q, 


so that 
a+Bry <até; 
it follows that 6 + y < 0, as required. 


. From Problem 1, 


y>pt+y’. 
Hence 


a+Bty>a+p+pt+y. 


. Let D be a point on the d-line segment BC such 


that the d-line segment AD is part of the d- 
line that bisects ZBAC. (This bisector exists, by 
Problem 7 of Subsection 6.3.4.) 


C 


d(A, B) = d(A, C) 
B 
A 


Reflection in this d-line exchanges the d-lines 
BA and CA, so it maps B to a point B’ some- 
where on CA. Since reflection preserves length, 


we have 

d(A, B) = d(A, B’). 
But 

d(A, B) =d(A,C), 
so 


d(A, B’) = d(A,C), 


and therefore B’ and C coincide. 
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Hence the reflection maps the d-triangle 
AABC onto the d-triangle AACB and exchanges 
the angles at B and C. Since reflection pre- 
serves angles, it follows that these angles must 
be equal, as required. 

. Let AABC and APQR be any two doubly 
asymptotic triangles, with the vertices in F 
being at A and P, respectively. 

Suppose, first, that the angles at A and P 
are equal. Then there is hyperbolic transforma- 
tion that maps AABC onto a triangle Aj, say, 
with A going to the origin O; since hyperbolic 
transformations do not alter the size of angles, 
the angle of A; at O equals the original angle 
A. Since there is no hyperbolic transformation 
that maps points of @ to points of Y, or vice- 
versa, A, is also a doubly asymptotic triangle. 
The triangle AABC is therefore d-congruent 
to Aj. 


Similarly, the triangle AP QR is d-congruent 
to a doubly asymptotic triangle Az with one 
vertex at O. 

Then there is a rotation of Y round the ori- 
gin (or a reflection in a diameter followed by 
a rotation) that maps the radial sides of Aj 
and A> onto each other; and, since there is 
only one d-line that ends at two given points 
of Y, this mapping must map the two curved 
sides of A; and A> onto each other. Finally, 
since A, and A> are d-congruent, it follows that 
the triangles AABC and APQR must also be 
d-congruent. 

Suppose, next, that the two doubly asymp- 
totic triangles AABC and APQR are d- 
congruent. By mapping A to O and P to O (as 
above), and rotating suitably, it follows from the 
d-congruence that the angles at A and P must 
be equal. 
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Remark 

There is no hyperbolic transformation that maps 
points of @ to points of Y, or vice-versa; hence 
a d-triangle can only be d-congruent to a dou- 
bly asymptotic triangle if it itself is a doubly 
asymptotic triangle. 


The two d-triangles AO B’ D’ and AOC’ D' 
have side OD’, which lies along a diameter, in 
common. Let r be the hyperbolic reflection in 
that diameter. 

Since the angles ZOD'B’ and ZOD'C’ are 
equal (both are 7/2), the image under the reflec- 
tion r of the d-line segment D/C’ will lie along 
the d-line segment D’B’. Also, since the angles 
ZB'OD' and ZC'OD' are equal (OD’ is the 
bisector of the angle 7B’ OC’), the image under 
the reflection r of side OC’ will lie along side 
OB’. It follows that B’ must reflect onto C’. 
Hence the d-triangles AO B’D’ and AOC'D' 
are d-congruent. 


C"; 


O BY 


It follows from the argument in the proof of 
Theorem 7, with 6 = 7/2, that the required 
upper bound must be the length of the d-line 
segment OD". 

Let m be the d-line that ends at B” = | and 
C” = i. Now m is part of the Euclidean circle 
with centre C at the point 1 +7 and radius 1. The 
Euclidean distance OC is afd so the Euclidean 
distance OD" is /2 — 1. 
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OD’= OC - CD” 
D” =\2-1 


1 
O 


It follows that the hyperbolic length of the 
altitude OD” is 


tanh7! (v2 = 1) ~ 0.44069. 


So the hyperbolic length of altitude AD is less 
than 0.4407. 


Ne 
Sz, 


It follows from the Distance Formula for 
hyperbolic lengths that the hyperbolic lengths of 
the two sides of the triangle which meet at right 
angles at the origin are 


d (0. + i)/2V2) =d (0. l1 + il/2V2) 


=4(0) 
= tanh! 0.5 


~ 0.5493 
and 
a (0. Gre i)/5V2) - (0. |—-1+ il/5V2) 
= d(0,0.2) 
= tanh! 0.2 
~ 0.2027. 


Then, if c is the hyperbolic length of the 
third side (the ‘hypotenuse’), it follows from 
Pythagoras’ Theorem that 


cosh 2c ~ cosh (2 x 0.5493) 
x cosh (2 x 0.2027) 
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= cosh (1.0986) x cosh (0.4054) 
~ 1.6667 x 1.0833 
~ 1.8055. 


Hence the required hyperbolic length is 


8. (a) 


(b) 


c ~ 4 cosh! (1.8055) 
~ 0.5983. 


3 
z 


It follows from the Distance Formula 
for hyperbolic lengths that the hyperbolic 
lengths of the two sides of the triangle 
which meet at right angles at the origin are 


d (0. 3) and d (0. ii). These two are equal 
to the common value 


d (0, ;) = tanh~! (0.75) ~ 0.9730. 


Then, if c is the hyperbolic length of the 
third side (the ‘hypotenuse’), it follows 
from Pythagoras’ Theorem that 


cosh 2c ~ cosh? (2 x 0.9730) 
= cosh*(1.9460) 
~ 12.755. 
Hence the required hyperbolic length is 
c = 5 cosh”! (12.755) 
~ 1.6188. 


0 r 


It follows from the Distance Formula that 
the hyperbolic lengths of the two sides of 


the triangle which meet at right angles at the 
origin are equal to the common value 


d(0,r) = tanh7! r. 


Then, if c is the hyperbolic length of the 
third side (the ‘hypotenuse’), it follows 
from Pythagoras’ Theorem that 


cosh 2c = cosh* 2d (0, r) 
= cosh? (2 tanh! r) : 


Hence the required hyperbolic length is 


C= 5 cosh7! (cosh? (2tanh! r)). 


It follows from Lobachevskii’s Formula that 


tanh (24 (0. _ 4)) 
: 33 
sinh (2d (0, — 3i)) 
tanh(2 tanh~! 0.8) 
sinh(2 tanh~! 0.6) 
__ tanh(2.1972) 

~~ sinh(1.3863) 

_, 0.9756 


~~ 1.875 
~ 0.5203, 


tan A = 


so that 


A ~ tan7! (0.5203) 
~ 0.4798 radians. 


10. 


11. 


Appendix 3: Solutions to the Problems 


We use the Angle of Parallelism Formula. 
The hyperbolic distance of the point 3 from 
the d-line @ with equation x = 0 is 


p =d (0,3) = tanh! 0.75 
~ 0.9730. 


It follows from the Formula that the required 
angle @ is such that 


1 


sinh (2d (0, })) 
1 
~ Sinh(2 x 0.9730) 


= 0.2917, 


tan g = 


so that 


gy ~ 0.2838 radians. 


Using the hint, we apply the Sine Formula to the 
d-triangle ADBA, whose angles are 


ZADB ==, ZABD=6, LDAB= 46, 
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and in which side DB has hyperbolic length 5a 
and side AB has hyperbolic length a. Thus 


Let AABC be a doubly asymptotic d- 
triangle in Y. First, we construct a tre- 
bly asymptotic d-triangle AABD as follows: 
extend the segment AC beyond C to meet 
@ at D. Then draw the trebly asymptotic 
d-triangle AABD. 

Now, the d-line containing the points A, 
C, D cuts off a minor arc of @ that con- 
tains B. The d-line joining B and D lies 
on the opposite side of the segment BC 
from A. 

Hence AABC is contained in the trebly 
asymptotic d-triangle AABD. 


(4 sinh a 1 
sin (59) =- = : 
sinh 2a 2 cosh a 


1 
a = cosh! i : 
2 sin 70 


which is a function of 6, as required. 


Hence 


Remark 

This answer is very plausible when a, the length of (b) 

the sides, is very small, for then sin (40) lems 5 and a 

so 0 ~ 2/3, which agrees closely with the value of E) 

the angle in a Euclidean equilateral triangle. C -) 
V 


It is also very plausible when a is very large, for 


then sin (59) is very small and so 6 ~ 0, which is 


consistent with the fact that a large d-triangle has 
small angles. 


Section 6.5 


1. 


Let the triangle be a trebly asymptotic d- 
triangle, and let £ be one of its sides. Then as 
a point P slides along the d-line @, the angle a 
varies from z to zero. In particular, there must 
be a point on ¢ for which aw = 7/2, as required. 


2.(a) 


A 


(» 


B 


_ Sf 
B’ Ry 
c 
Let AABC be a doubly asymptotic d-triangle in 
J, with A being the vertex of the d-triangle that 
does not lie on @; and let a = ZBAC. 

Then there is a M6bius transformation M that 
maps Y to itself, with @ mapping to @, that 
maps A to the origin O. Let M map B, C onto 
B’', C’, respectively. Then M maps the dou- 
bly asymptotic d-triangle AABC onto another 
doubly asymptotic d-triangle, AOB'C’; and, 
since Mobius transformations preserve angles, 
we have that 7B’OC' = a. 

Any other doubly asymptotic d-triangle 
whose angle at its vertex in Z is a can also 
be mapped by a suitable Mébius transformation 
onto another doubly asymptotic d-triangle that 
has its angle at the origin also of magnitude a; 
and then onto the previous doubly asymptotic d- 
triangle AOB'C’ by a further rotation — which is 
again a Mobius transformation. 

So every doubly asymptotic d-triangle whose 
angle at its vertex in Y is a is d-congruent to 
the d-triangle AOB’C’, and hence has the same 
area as AOB'C’. 
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Hence the area of a doubly asymptotic d- 
triangle depends only on the angle at its vertex 
in J, 


Chapter 7 
Section 7.1 


1. 


(a) Such a rotation of S” exists, by the result of 
Example 2. 
(b) 


Rotation about the axis through the points 

(= oe 4) aad (-+, ane -4;) 

through an angle of 2 as shown, maps 

(1, 0,0) to (0, 1,0) and (0, 1,0) to (0,0, 1). 
(c) 


(0, 0, 1) 


(0, -1, 0) (0, 1, 0) 


d, 0, 0) 


The line joining the points (1,0,0) and 
(0,0,1) is a quarter of a circumference 
in length, whereas the line joining the 
points (0,—1,0) and (0,1,0) is a half- 
circumference in length. Hence there is no 
rotation that maps (1,0,0) bk (0,-—1,0) 
and (0,0, 1) +> (0, 1,0). 


. First, let 
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(cos¢siné,singsinO,cosé) = 


(0, - e 3); then, we must have 


cos @ sind = 0, (1) 
. ay 
sing sind = —+>, (2) 
and 
_ 1 
cos@ = 5. (3) 
From equation (3) we have 0 = 5 It follows 
from equation (1) that cos@é = O, and from 
equation (2) that sing - ae = aN so that 


* 2 _ 3 
sing = —1; hence ¢ = =. 
Next, let (cos@sin@,sing@sin@,cos@) = 
1 2 3 
(ee 


) ; then, we must have 


cos @ sin@ = wire (4) 
sing sind = wire (5) 

and 
cos 6 = ware (6) 


From equation (6) we have 6 = cos~! (+) jo 


cos~!(0.8018) ~ 0.64 radians. In particular, 
we have that sin@ > O, so that sn@0d = 


2 
{= (+) = Ja It follows from equa- 


tion (4) that cos ¢@ = ar . ue = wot and from 


equation (5) that sing = a : ue = wat hence 


@ = 5.18 radians. 


. For the point (5 v3 ) we know that 6 = 


3 1 
2/2’ 22’ J2 
7 and ¢ = %. It follows that the latitude of this 


‘ ‘ Po oe FE T FE * * 
point is 0° = 5 — 7 = 4, the colatitude is 


d= i and the longitude is @ = z. 


. For the point 45° W of the Greenwich meridian 


and 30° S, we have that 6 = 27 — 7 = es and 
6=F+%= an It follows that the point has 
spherical polar coordinates 
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(cos @ sin, sin @ sin 9, cos 0) Now, 


—_ Tm ot 2m ot Tm os, 20 Qn 
=> (cos 4 sin 3 , Sin 4 sin 3 , COS =) 


2 J ge all, 
V2 v2\ (3 Ja 
=((4) (4) ()--2) eo 
V2) 2)? Va 2)? 2 ent Ge le 0 ait 
- (4 _ v3 -}) v2 Me 2 
~ A272? 2/2? 27° 
so that the image of (1,0,0) under R (Y, a) is 
5. tot 
(35-4) 
Similarly, the image of (0, 1, 0) is (0, 1,0) and 
Y\ tant the image of (0,0, 1) is (+. 0, 43): 
feos Finally, since 
i. 9 4) 7-4 es 
V2 V2 vit VJ 
The tangents to the given two great circles form , : _ ~ | Via |]? 
two lines in the tangent plane at the point. V2 v2 J14 vi 


The two lines that bisect these angles are both 
tangent lines to great circles; these great cir- 


cles bisect the angles between the original great P gai eon 
circles. (4.-Fe 4a): 
bisector 2. Here 
cosy —siny 0 
R(Z,y)=|siny cosy 0 
0 0 1 
bisector and 
cosB 0 sinB 
R(Y, B) = 0 1 0 
_ —sinB O cosB 
If the angles between the original two tan- 
gents are a and #, then clearly 2a + 26 = 27. co 
From the above diagram it is obvious that the : 
angle between the two bisectors is 50 + 5 B, that ; 
is 5m, as required. cosy —siny 0 
R(Z,y)-R(Y,B) =| siny cosy 0O 
0 0 1 
Section 72 


cosB OO sin 


1. The matrix of the transformation R (Y, a) iS x 0 1 0 


—sinB O cosp 
‘ 1 1 
cosZ O sing v2 0 V2 cosycosB —siny cosy sinB 
0) 1 0 — 0 1 Of. =| sinycosB cosy  sinysinB }, 
—sint O cos] oo 


Va Vi — sin B 0 cos B 
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and 
cosB OO sing 
—snB 0 cosp 


cosy —siny 0 
x] siny cosy 0 
0 0 1 


cosBcosy —cosfsiny  sinB 
= sin y cos y 0 
—sinBcosy  sinBsiny  cosB 


3. The matrix of R(X, a) is 
1 0 0) 


A= {0 cosa -—sina]. Then 
QO sina cosa 


1 0) 0 
A’ =|[0 cosa sina 
QO —sina cosa 
1 0 0 
= |0 cos(—a) —sin(—a@) ], 
sin(—@) cos(—a@) 


which is the matrix of R(X, —a). 
The matrix of R(Y,B) is A= 
cosB OO sin 
0 1 0 
—sinB O cosp 


. Then 


cosB O —sinB 


AT={ 0 1 0 
sin6B O  cosp 
cos(—B) 0O- sin(—f) 
= 0 1 0 : 


—sin(—B) 0. cos(—f) 


which is the matrix of R(Y, —). 
The matrix of 
cosy —siny 0 
siny cosy 0O].Then 

0 0) 1 


R(Z,y) is A= 
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cosy siny O 


A’ =|-siny cosy 0 
0 0 1 

cos(—y) —sin(—y) 0O 

=| sin(-y) cos(-y) 0O 

0 0) 1 


which is the matrix of R(Z, —y). 


. As you saw in the solution to Problem 3 of 


i i 1 7 1 
Subsection 7.1.2, the point P (5 5a +3) 


has spherical polar coordinates of the form 


(cos @ sind, sing sin@,cos@), where @ = 5 
and 6 = 4; then also 0’ = 4 — 6 = 4. It fol- 


lows from Theorem 2 that the rotation that maps 
A(1,0,0) to P is 


A = R(Z,¢) R(Y,—0’) 


7 : 
cos —sin 3 0 
= x Em 
sin 3 COs 3 0 
0 0 1 
ria 5 is 
cos F QO -—sin Z 


IU ri 1s Iv i 
COs 3 COS F sin 3 cos 3 sin Z 
aii is rs : ‘4 
=| sinZcos% cos 4 sin % sin 7 
sin Z 0 cos 
1 V3 1 
2/2 2 2/2 
=|”! 1 _¥v¥3 
2/2 2 2/2 
as 0 tI 
J2 V2 
This matrix performs the desired rotation, 
since 
oe ee oie 
2/2 2 22 1 2V2 
ve 1 #3 | fo) a] “2 
2/2 «2 2/2 0 2/2 |’ 
+t 9 = 2 
V2 J2 J2 


as required. 
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5. As you saw in Problem 4, the matrix 6. The spherical polar coordinates of the 
1 J3 I point Q (4.-3.-4) are of the form 
2/2 a 2/2 (cos ¢ sind, sing sin6,cos@), where cos 9 = 
MA 1 3 wages _ 3a. 10 = 
OW 5 5/3 Ja it follows that 0 2 ree hence sin@ = 
ii 
tog 4 45,6 =$-0=$-¥ = -4, cosp= J, 
v2 v2 : 1 Tn 
: ; and sing = —~, so that @ = =. 
corresponds to a rotation of S* that maps It follows, i Theorem 2, that the rota- 
A(1,0,0) to (<5 B., 45) so its inverse tion of S? that maps A(1,0,0) to Q is 
(which is also its transpose) R(Z, i) R(Y, 4). Hence the rotation of S? 
i V3 i that maps Q to A(1, 0,0) is 
2/2 2/2 2 
v3 1 = 
“F 2 8 (R(Z,7)oR(¥.4)) 
1 V3 1 _ 1 
Wi Wi Vi = (R(¥,%)) | 0(R(Z,4)) 
corresponds to a rotation of S? that maps = R(Y,-4)oR(Z,-4); 
ed, Ad al 
(a ae 2) bee it: 
Now the matrix of the rotation that maps this has matrix 
A(1, 0, 0) to N(0, 0, 1) is (matrix of R(Y, 4))? x (matrix of R(Z, TENSE 
0 0 -!1 
R(Y,-3)=[0 1 O]; Ag SL\V ye, oh 
= 0 1 0 - ll 9 
so, by composition, the matrix of the rotation ‘ i v2 v2 
that maps P to N is V2 0 V2 0 0 1 
I V3 1 a 
Wl Wr Vi - = 
0 0 -!1 ne oe ee 0 
01 0 _v3 1 0 ~ 7 v2 v2 
10 0 oe 2-3 oF 
1 V3 1 > : v2 
2/2 2/2 V2 
1 V3 1 Therefore, by composition (using the result of 
2/2 2/2 v2 Problem 4), the matrix of the rotation of S 2 that 
= = 5 0 |. maps Q to P is 
<t. wae A A 
2/2 22 = V2 pee eee fee ee 
wW2 2 2/2 2 2 V2 
This matrix performs the desired rotation, since V3 1 V3 tooo 0 
Wi 2 ~wa|| v2 v2 
a ee i @ oof 4 
Y 2/2 V2 2/2 0 Va Va 2 2 Va 
ai 1 ci 
eee ee J. =| 2. 1 i 
22 w2 V2 v2 2/2 2/2 2 
: 1 1 
as required. ae ae 0 
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This matrix performs the desired rotation, since 


V3 V3 1 1 
2/2 2/2 2 2 
1 i V3 axl 
2/2 2/2 2 2 
1 1 als 
in ° 2 
oe 
2/2 
_— | v3 
~ 12/2 7? 
=e 
2 
as required. 
7. Here 
1 0 O 
dtA=]|]0 1 O|=-1. 
0 0 -!i 


8. First, since 37 + 47 + (—5)* = 50 = (sv2) . 
we write the equation of the plane m : 3x + 
4y -—5z = PL ee ny ae = a 
where a = ab = ia andc = at =e 

Then, by Theorem 3, the matrix associated with 

reflection in the plane z is 


1 18 24 30 
50 50 50 
24 1 32 40 
50 50 50 
30 40 1 50 
50 50 50 
16 12 3 
29: 5 5 
_2 9 4 
— 25 25 5 
3 4 
5 5 0 


9. A reflection of S* in a great circle is an isom- 
etry, so the composite of an even number of 
reflections in great circles is also an isometry of 
S*. Each reflection in a great circle reverses the 
orientation of S?, so the composite of an even 
number of reflections in great circles does not 
alter the orientation; hence the composite is a 
direct isometry. 

It follows from part (b) of Theorem 7 that the 
composite of an even number of reflections of 
S? in great circles is a rotation of S?. 
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Section 73 


1. 
N(0,0,1) 
ZT 


P (cos @, sin @, 0) 


Let A and WN be the points (1,0,0) and 
(0,0, 1), respectively; and let P be the point 
(cos ¢,sing@,0). Then the spherical triangle 
AAPN has angles 5, 4 and ¢, 
particular it has angular excess @. 

(a) Let @ = 
spherical triangle. 

(b) Let @ = 3.14 radians (note that 3.14 < z, so 
that the triangle AA PWN does not ‘cross back on 
itself’); this gives the required spherical triangle. 

2: 


so that in 


0.01 radians; this gives the required 


Triangle II Clearly ZQ’PR = a — a and 
ZPRO' = x — y. Also, by the symmetry of 
the intersections of the great circles Q’PQ and 
OROQ'R’, 
ZRQO' P = ZR'QP’ (by symmetry) 
= BP (vertically opposite). 


Triangle III Clearly ZR'PQ'’ = ZOQPR 
(vertically opposite) = a and ZPQ'R’ = x — 
ZPQO'R = x — f. Finally, by the symmetry of 
the intersections of the great circles RPR’ and 
ORO'R’, 

ZO'R’P = ZORP’ (by symmetry) 


=T-y. 
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3; 


4. 


Triangle IV Clearly ZQPR’ = zt — a, 
ZPQR' = x—f, and (from the result of Triangle 
Il) ZPR’'O =x—ZPR'O! =n-(n -y)=y. 


N(O, 0, 1) 


P(cos @, sin @, 0) 


Let A and WN be the points (1,0,0) and 
(0,0, 1), respectively; and let P be the point 
(cos ¢, sing, 0). 

Then the spherical triangle AA P N has angles 
5, % and @, so that in particular it has angular 
excess @ and so (by Theorem 3 of Subsec- 
tion 7.3.1) has area @. 

Let 6 = at radians; this gives the required 
spherical triangle. 


N(O, 0, 1) 


B(O, 1, 0) 


The spherical triangle AABN has angles 7, 5 
and 4, so that any dual triangle A’ has sides 
each of length (x — z) = 7. The sides of 
the spherical triangle AABN also each have 
length 5. 

Similarly, the spherical triangle AABN has 
sides 5, and 4, so that any dual triangle A’ 
has angles each of magnitude (x — 3) = 4. 
The angles of the spherical triangle AABN also 
each have magnitude 7. 

It follows that AABN and any dual triangle 
A’ each have angles of the same magnitude and 
sides of the same magnitude. So clearly AABN 


and the dual triangle A’ are congruent. 


5. 
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We use the result of Theorem 5 of Subsec- 
tion 7.3.3. 

The colatitude of Tokyo is 90° — 36° = 54°, 
so that the coordinates of the corresponding 
point P on S? are 
(cos(140°) sin(54°), sin(140°) sin(54°), 

cos(54°)) 

~ (—0.7660 - 0.8090, 0.6428 - 0.8090, 0.5878) 

~ (—0.6197, 0.5200, 0.5878). 


The colatitude of Rio de Janeiro is 90° + 
23° = 113°, so that the coordinates of the 
corresponding point Q on S? are 

(cos(—43°) sin(113°), sin(—43°) sin(113°), 

cos(113°)) 

~ (0.7314 - 0.9205, —0.6820 - 0.9205, 
— 0.3907) 

~ (0.6733, —0.6278, —0.3907). 

Hence the distance between Tokyo and Rio de 

Janeiro is approximately 4000d miles, where 
cos d = —0.6197 - 0.6733 — 0.5200 - 0.6278 
— 0.5878 - 0.3907 
~ —0.4172 — 0.3265 — 0.2297 
= —0.9734. 


Thus d ~ cos—~!(—0.9734) ~ 2.9104 radians, 
and the required distance is 


4000d =~ 4000 - 2.9104 ~ 11 642 miles. 


Remark 

To determine this distance in terms of metric 
measurements, we replace the figure of 4000 
miles above by 6378 km; thus the required 
distance is 


6378d ~ 6378 - 2.9104 
~ 18563 km. 
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6. 
Let d be the length of OR. By applying Pythago- 
ras’ Theorem to the triangle AP QR, we obtain 
cos 1.7 ~ cos 1.9 x cosd, 
so that 
d cos 1.7 
cosd & 
cos 1.9 
. —0.1288 
~ —0.3233 
~ 0.3985; 
it follows that 
d ~ cos! 0.3985 ~ 1.16. 
Te 


Let 6 denote the angle 7Q PR. We saw in Prob- 
lem 6 that OR ~ 1.16. It follows from the 
second formula in Theorem 7 that 


sin 1.16 
sin 1.7 


_ 0.9168 
~ 0.9917 
~ 0.9245; 


sin@ ~ 


it follows that 


6 ~ sin7! 0.9245 ~ 1.18 radians. 
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We use here the identities 

cos 2x = 1 —2sin? x 
and 

sin 2x = 2sinx cos x. 


For x € R, we deduce from the result of part (b) 
of Example 3 that 


1 
tan sa) 
cosa = 
tana 
sin (1a 
2 cosa 
cos (3) ae 
COs a cos a 
7 2 cos? (52) 1+ cosa 
Section 7.4 


1. (a) 


The inverse of the stereographic projec- 
tion applied to the point P(x, y) gives the 


coordinates 

my) =( 2 : 2 y , 
xe ye ee yee 
aeons 
x24 y241 

for P’. 


Then P” has coordinates 


2x 2y x*4+y?—1), 
P+ Pe ety yy 
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and the formula for stereographic projection 
gives the following coordinates for Q: 


2x 
(ape 


2y 
sprees) 


_ if y 
= x24 y2? x2 4 y2 . 


(b) The point (x,y) and (ata az) are 
inverse points with respect to the unit cir- 
cle x?7+ y? = | (see Theorem 2, Subsection 
5.1.2). 


P(2, 2) Q(3, 3) 


S 


The coordinates of P(2,2) are 
(2V2 cos 4, 2/2 sin T)s 

and the coordinates of Q(3, 3) are 
(3V2cos %,3V2sin 2). 


Then it follows from Theorem | that the spheri- 
cal distances SP’ and SQ’ are 


2tan7! (2v2) and 2tan7! (3v2) 
respectively. But S,N,P’ and Q’ all lie in 


the same vertical plane, so that the spherical 
distance P’Q’ is 
(spherical distance SQ’) — (spherical distance 
SP’) 
= 2tan! (3v2) —2tan7! (2v2) : 
It then follows from the identity 
pea 
1+xy’ 
for x,y € R, that the required distance can be 
expressed in the form 


tan} x — tan”! y =tan- 


32 =, 
i+ (3v2) (2v2) 


2tan~! 
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2 
= 2tan~! (4) ~ 0.2167. 


3. Let one of the fixed points of the rotation of 
S? be P’(X,Y,Z); then the other must be 
Q'(—X,—Y, —Z). Under stereographic projec- 
xX sy 


=X =Y \s. m2 : , 
and Q (= iy) in R~, respectively; we can 


tion these map to the points P 


express these coordinates as complex numbers 


_ X+i¥ ot ae 
s=pfztipg = Tzant = Gt 
4 —X-iY 
'THZ I+Z 
Then 
4 X+iY —X—-iY 
Ss: = ‘ 
1-Z 1+Z 
_X-iY -X-iY 
~ "T= 2 14+Z 
a= 
— —_ i 
1-22 


since X2 + Y2 + Z* = 1. It follows that t = 
—-I/s. 


4. Arotation R(X, a) of S? fixes the points (1, 0,0) 


and (—1,0,0), which map under stereographic 
projection to the points | and —1, respectively. 

By Theorem 3, a conjugate Mobius transfor- 
mation must be of the form 


(where we use c and d to avoid using the letters 
a and b for different things). 
This mapping fixes | and —1 if 


d _ d 
io ag ate a | 
—-d+c d+c 
which we can rewrite in the form 
-—dt+éGé=c+d and —d-—é=-c+d. 


Adding these equations gives —2d = 2d, so that 
d is imaginary, and subtracting the equations 
gives 2c = 2c, so that c is real. 

It follows that M has the desired form 
M(z) = a with a in place of c and ib in 
place of d. 
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5. There are no fixed points of the Mobius trans- 


formation z +» z+ 1 in C, since the equation 
z = z+ 1 has no solutions in C. However the 
Mobius transformation maps oo to itself, so co 
is a fixed point in C of the transformation. 

The fixed points in C of the Mobius trans- 


formation z > 4 are the solutions of the 


z+4 
equation z = =. We may rewrite this equation 
in the form z* + 4z +4 = 0, or (z +2)” = 0, so 
that —2 is the only fixed point in C of the trans- 
formation. Since co +> 0, —2 is also the only 
fixed point in C of the transformation. 

The fixed points in C of the Mobius trans- 
formation z b> ss are the solutions of the 
equation z = =. We may rewrite this equation 
in the form z7+5z+4 = 0, or (z+4)(z+1) = 0, 
so that —4 and —1 are the fixed points in C 
of the transformation. Since co - O,—4 and 
—1 are also the only fixed points in C of the 
transformation. 


. Let C denote the circle in R* with equation 
x? + y? 4+ 2ax + 2By+y =0, 


which we may rewrite in the more convenient 
form 


(x7 + y? +1) + 20x + 2By+(y —1) =0 
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or 
l+a = +p a4 
oe ee a ea 
y-1 
x2+y24+1 ~ 


Now under the inverse mapping 27! 


(x,y) H (X,Y, Z) of stereographic projection, 


x= 2x 
x24 y2 417 
2 
Vea, 
x2+y24]1 
and 
x+y? 1 
x24 y24 17 
in particular, Z = 1 — Peers so. that 
Pe 


It follows that C corresponds under the 
inverse mapping ~! of stereographic projec- 
tion to a plane with equation 


1+aX+ BY +3(y-DU-Z)=0, 
or 


aX +2~Y+(U—-y)Z+(+y)=0. 
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A(2), 71 
affine 
classification of conics, 108 
group, 71, 471 
properties, 73 
transformation, 71 
transformations, basic properties, 73 
affine geometry, 73 
congruence, 86, 111 
Fundamental Theorem, 86 
rectangular hyperbola, 116 
Alberti, Leone Battistuta, 130 
altitude, 391, 447 
Altitude Theorem, 391 
angle, 273, 351 
of lune, 441 
of parallelism, 395 
sum, 383, 384, 388, 439 
Angle Theorem, 274 
angular defect, 406 
angular excess, 440 
antipodal point, 425 
Apollonian circles, 261, 317 
family of, 318 
Apollonian Circles Theorem, 317, 319 
Apollonius of Perga, | 
arbelos, 330 
Archimedes, xi 
area 
hyperbolic, 401 
spherical, 441 
argument, 277 
associated circle and plane, 459 
asymptote, 17, 474 
asymptotic triangle, 388 
Aubel’s Theorem, van, 120 
auxiliary circle, 32 
axes 
ellipse, 15 
hyperbola, 17 
parabola, 13 


B? — 4AC Test, 42, 213 
barycentric coordinates, 102 


collinearity, 104 

equation of line, 104 

Section Formula, 105 
Bayeux tapestry, 128 
Beltrami—Klein model, 478 
Bessel, Friedrich Wilhelm, 345 
Bolyai, Janos, 3, 344 
boundary point, 346 
Brannan, Michael, xiv 
Brianchon, Charles J., 177 
Brianchon’s Theorem, 177, 252 
Brisbane Exhibition and Conference 

Centre, 52 

Brunelleschi, 129 


camera, aerial, 189 
canonical form, 363, 365 
Cassegrain focus, 32 
centre 

of inversion, 263 

of projective reflection, 475 
Ceva, Giovanni, 93 
Ceva’s Theorem, 94, 107 

converse, 97, 108 
circle, 6 

auxiliary, 32 

equation, 7 

generalized, 285 

great, 426 

hyperbolic, 373 

little, 426 

of Apollonius, 317 

of inversion, 263 
circles 

intersecting, 9, 321 

kissing, 335 

orthogonal, 9 

Soddy, 335 
coaxal circles, 317, 322, 458 
Coaxal Circles Theorem, 322 
colatitude, 428 
collinearity in RP?, 141, 142, 145 
common perpendicular, 388 
Common Perpendiculars Theorem, 388 


583 


584 


compass bearing, constant, 463 
complex number, 276 
computer graphics, xii, 127 
Concentricity Theorem, 326 
congruence, 69, 163, 215, 316, 385 
affine, 111 
Euclidean, 69 
G-congruence, 70 
conic, 6 
classification, 41, 42 
focus—directrix definition, 11 
general equation, 36 
matrix representation, 37 
polar equation, 18 
projective, 202, 205 
section, 6, 22, 210 
section, degenerate, 6 
section, non-degenerate, 6 
tangent, 23, 25 
conjugate diameters, 78 
Conjugate Diameters Theorem, 78, 116 
conjugate transformation, 454 
cooling tower, 48 
Cosine Rule, 449 
Coxeter, H. S. M., 409 
cross-ratio, 179, 180, 187, 188, 475 
on embedding plane, 187 


Dandelin, Germinal Pierre, 22 
spheres, 22, 53 
d-congruent, 385 
Desargues, Girard, 1, 2 
Desargues’ Theorem, 135, 173 
Descartes, René, | 
diagonal vanishing point, 134 
Difference of Focal Distances Property, 20 
directed distances, 52 
direct transformation, 362, 365, 429 
directrix, 12 
discriminant, 111 
distance, 7, 367 
function, 367, 368 
hyperbolic, 368, 369, 380, 481 
in R*,7 
spherical, 426, 444 
d-line, 346, 347 
boundary point, 346 
doubling map, 81, 297 
doubly asymptotic triangle, 388, 401 
d-point, 345 
d-quadrilateral, 384 
d-triangle, 383 
Dual Triangles Theorem, 443 
duality, 176, 250 
Duality, Principle of, 178, 250 
Duccio, 129 
Diirer, Albrecht, 130 


eccentricity, 12 
Eccentricity Formula, 209 


Index 


Ecole Polytechnique, 2 
electrostatics, 2 
Elements of Euclid, 1 
ellipse, 14 
parametric representation, 15 
Reflection Property, 29 
standard form, 15 
embedding plane, 147, 148 
standard, 148 
envelope, 32 
equatorial plane, 428, 433 
equidistant curves, 399 
equivalence relation, 69 
Erlangen Programme, 4, 478 
Escher, Maurits Cornelis, 409 
Euclid, xi 
Euclidean, 62 
congruence, 68 
geometry, 62 
properties, 64 
transformation, 66 
extended 
complex plane, 283 
conjugation function, 286 
line, 285 
linear function, 287 
plane, 284 
reciprocal function, 287 
External Reflection Property, 31 


families of circles, 321 
coaxal, 317, 322, 458 
Fermat, Pierre de, 1 
Five Points Theorem, 230 
focal chord, 13 
focal distances 
ellipse, 19 
hyperbola, 20 
focus, 12, 32 
foreshortening, 134 
function 
extended conjugation, 286 
extended linear, 287 
extended reciprocal, 287 
linear, 282 
notation, xiii 
reciprocal, 283 
Fundamental Theorem 
Affine Geometry, 88, 93 
Inversive Geometry, 310, 312 
Projective Geometry, 162, 172 


Gg, 350 
Gauss, Carl Friedrich, 344, 345, 405 
generalized circle, 285 
generating lines, 49 
generator of cone, 210 
geodesic, 377, 409, 412, 426 
geometry, 61 
affine, 73 


Index 


elliptic, 343, 424 
Euclidean, 62 
hyperbolic, 345, 412 
model, 4 
non-Euclidean, 344 
Giotto, 129 
Goethe, Johann Wolfgang, xi 
gradient, 13 
great circle, 426 
Greenwich meridian, 426 


half-plane model, 412 
hemisphere, 483 
hexagon, 177 
hierarchy, 475 
Hilbert, David, 485 
homogeneous coordinates, 138 
homogeneous matrix, 471 
horizon, 135 
horosphere, 485 
hyperbola, 16 
asymptotes, 17 
parametric representation, 18 
rectangular, 18 
Reflection Property, 30 
standard form, 18 
hyperbolic 
area, 401 
centre, 373 
circle, 373 
distance, 368, 369, 380, 481 
geometry, 345, 412 
group Gg, 350 
limit rotation, 363 
midpoint, 371 
paraboloid, 43, 51 
parallel postulate, 344 
radius, 373 
reflection, 350, 379 
rotation, 362 
transformation, 350 
translation, 363 
triangle, 383 
hyperboloid of one sheet, 43, 48 


ideal Line, 148 
ideal Point, 148, 188 
Incidence Property of RP2, 146 
indirect transformation, 362, 429 
infinity, point at, 284 
inside of triangle on S 2. 440 
Internal Reflection Property, 31 
inverse, 263 
inversion, 262, 265 
inversive 

geometry, 295 

group, 296 

transformation, 295 
Inversive Geometry, Fundamental 

Theorem, 310, 312 


isometry, 63, 278, 429 
Isosceles Triangle Theorem, 437 


Joachimsthal, 216 
notation, 218, 227 
Section Equation, 219 


kaleidoscope, 409 

kiss precise, 335 
kissing circles, 335 
Klein, Christian Felix, 4 
Kleinian view, 61 


La Hire’s Theorem, 225, 244 
Lady’s and Gentleman’s Diary, xi 
latitude, 428 
length on S2, 426 
limit rotation, 363 
line on S2, 426 
Line, 141 
Conic, 251 
equation of, 141, 144 
ideal, 148 
linear function, 282 
linearization map, 478 
Little Chef, restaurant, 52 
little circle, 426 


Lobachevskii, Nikolai Ivanovich, 3, 344 


Lobachevskii’s Formula, 394, 398 
longitude, 428 

Lovell radio-telescope, 29 

lune, 441 


map, planar, 460 
Martini, Simone, 129 
matrix of rotation, 38, 429 
Median Theorem, 93 
Menelaus of Alexandria, 93 
Menelaus’ Theorem, 98, 106 
Converse, 101, 106 
Mercator, 463 
meridian, 426, 461 
midpoint, 78 
hyperbolic, 371 
Midpoint Theorem, 78 
Mobius, August Ferdinand, 3 
band, 3 
transformation, 298, 454 
model (of a geometry), 4 
disc, 345 
elliptic, 424 
half-plane, 412 
hyperbolic, 345 
spherical, 425 
modulus, 276 
Monge, Gaspard, 2, 177 


Newton, Isaac, | 
non-Euclidean, 344 
normal, 26 
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586 


oblate spheroid, 460 

Origin Lemma, 351 

orthogonal, 9, 65, 322, 432 
outside of triangle on S2, 440, 448 


P(2), 152 
Pappus’ Theorem, 175, 177, 186 
parabola, 12 
parametric representation, 13 
Reflection Property, 31 
standard form, 13 
vertex, 13 
parallel, definition, 349 
parallel postulate 
elliptic, 344 
Euclidean, 343 
Hyperbolic, 344 
parallel projection, 74 
basic properties, 75 
composition, 82 
parallels, 461 
parameter of projective reflection, 475 
Parametrization Theorem, 237, 246 
Pascal, Blaise, 2 
Pascal’s Theorem, 238 
converse, 240 
perpendicular, 388 
common, 388 
on S”, 448 
perspective, 128 
focused, 129 
terraced, 128 
transformation, 166, 167 
vertical, 129 
perspectivity, 131 
Perspectivity Theorem, 172 
Plato, xi 
Poincaré, Jules Henri, 4 
point 
at infinity, 284 
circle, 318 
Point, 137 
conic, 251 
ideal, 148, 188 
unit, 146 
polar, 26, 223, 224, 226 
equation of conic, 18 
form, 277 
pole, 224 
Pole, 425 
Poncelet, Jean Victor, 2 
Poncelet’s porism, 329 
porism, 328 
power of a point, 52 
principal 
argument, 277 
vanishing point, 134 
Principle of Duality, 178, 250 
Product Theorem, 436 
projection, 461 
azimuthal, 462 


Index 


conformal, 462 

conical, 463 

cylindrical, 462 

equal area, 463 

gnomic, 462 

Mercator, 463 

orthogonal, 462 

orthographic, 462 

parallel, 74 

stereographic, 290, 450, 462 
projective 

congruence, 163 

figure, 140 

geometry, 156 

hierarchy, 475 

line, 141 

plane, 137 

point, 137 

properties, 156 

reflection, 475 

transformation, 151 
projective conic, 202, 205 

degenerate, 205 

inside, 207 

non-degenerate, 205 

outside, 207 

standard form, 237, 242 

tangent, 207 
Projective Geometry, Fundamental 

Theorem, 162, 172 

Ptolemy’s Theorem, 330 
punctured, definition, 268 
Pythagoras’ Theorem, 392, 396, 445 


quadric, 42 
classification, 45 
degenerate, 42 
matrix representation, 44 
surface, 42, 43 
quadrilateral, 162, 384 


radical axis, 321 
railroad lines, 134 
real projective plane, 137 
reciprocal function, 283 
rectangular hyperbola, 18 
reflection, 28 

hyperbolic, 361, 379, 411, 480 

Law, 28 

Lemma, 379 
Reflection Property 

ellipse, 29 

hyperbola, 30 

parabola, 31 
Riemann sphere, 290, 450 
Roque de los Muchachos Observatory, 31 
rotation, 38, 278 

elementary, 429 

hyperbolic, 362 

matrix, 429 


Index 


RP?, 137 

RP3, 471 

ruled surface, 48 
generators, 49 


scaling, 280, 461 
Scorer, Sam, 51 
Section Equation, Joachimsthal’s, 
219 
Section Formula, 105, 496 
self-dual, definition, 176 
self-inverse, 264 
shoemaker’s knife, 330 
simply asymptotic, 388 
Sine Formula, 29, 396 
Sine Rule, 448 
slope, 13 
Soddy, Frederick, 335 
sphere, Riemann, 290, 450 
spherical 
distance, 426, 444 
geometry, 425 
isometry, 429 
polar coordinates, 428 
triangle, 437, 438 
trigonometry, 438 
Sputnik, xi 
standard 
embedding plane, 148 
projective conic, 237, 242 
Steiner, Jakob, 328 
Steiner’s porism, 328 
stereographic projection, 290, 450, 
462 
strict line segment, 440 
strict triangle, 440 
subgeometry, 472 
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Sum of Focal Distances Property, 19 
Symmetry Lemma, 273 


tangent, 23, 207, 216, 225, 474 
pair, 222, 226 
to conic in standard form, 114 
tapestry, Bayeux, 128 
telescope, 31 
tessellation, 408 
hyperbolic, 409 
Three Points Theorem, 233 
Three Tangents and Three Chords 
Theorem, 244 
Three Tangents Theorem, 216 
translation, 277 
trebly asymptotic, 388, 401, 402 
triangle 
dual, 442 
spherical, 438 
triangle of reference, 146 
Triangle Inequality, 367 
Two Apollonian Circles Theorem, 327 


ultra-parallel, definition, 349 
Unique Fourth Point Theorem, 185 
unit Point, 146 


van Aubel’s Theorem, 120 
vanishing line, 135 
vanishing point 
diagonal, 134 
principal, 134 
vertex of parabola, 13 
Vinci, Leonardo da, 130 


wallpaper patterns, 408 


